Título: Searching for linguistic phenomena in literary digital libraries
Autores: Sánchez Martínez, Felipe
Forcada Zubizarreta, Mikel L.
Carrasco Jiménez, Rafael Carlos
Fecha: 2013-03-25
2013-03-25
2008-09
Publicador: RUA Docencia
Fuente:
Tipo: info:eu-repo/semantics/conferenceObject
Tema: Machine translation
Search engine
Lucene
Morphological information
Digital libraries
Lenguajes y Sistemas Informáticos
Descripción: This paper describes a set of tools and Java classes that allow the Lucene text search engine to use morphological information to index and search; in particular, it describes the use of the linguistic resources developed for the Apertium open-source machine translation platform to extract morphological information while indexing. We describe which linguistic information is automatically obtained, how to use it when indexing new documents with Lucene, and how linguistic attributes can be used to specify query terms. The use of morphological information makes it possible to search for specific linguistic phenomena, and to explore in a richer way the cultural heritage in current digital libraries.
Idioma: Inglés

Artículos similares:

Choosing the correct paradigm for unknown words in rule-based machine translation systems por Sánchez Cartagena, Víctor Manuel,Esplà Gomis, Miquel,Sánchez Martínez, Felipe,Pérez Ortiz, Juan Antonio
Using external sources of bilingual information for on-the-fly word alignment por Esplà Gomis, Miquel,Sánchez Martínez, Felipe,Forcada Zubizarreta, Mikel L.
10