Título: Multilingual extension of a temporal expression normalizer using annotated corpora
Autores: Saquete Boró, Estela
Martínez Barco, Patricio
Muñoz Guillena, Rafael
Negri, Matteo
Speranza, Manuela
Sprugnoli, Rachele
Fecha: 2012-05-17
2012-05-17
2006
Publicador: RUA Docencia
Fuente:
Tipo: info:eu-repo/semantics/conferenceObject
Tema: Multilingual extension
TERSEO
Temporal expression normalizer
Annotated corpora
Lenguajes y Sistemas Informáticos
Descripción: This paper presents the automatic extension to other languages of TERSEO, a knowledge-based system for the recognition and normalization of temporal expressions originally developed for Spanish. TERSEO was first extended to English through the automatic translation of the temporal expressions. Then, an improved porting process was applied to Italian, where the automatic translation of the temporal expressions from English and from Spanish was combined with the extraction of new expressions from an Italian annotated corpus. Experimental results demonstrate how, while still adhering to the rule-based paradigm, the development of automatic rule translation procedures allowed us to minimize the effort required for porting to new languages. Relying on such procedures, and without any manual effort or previous knowledge of the target language, TERSEO recognizes and normalizes temporal expressions in Italian with good results (72% precision and 83% recall for recognition).
This research was partially funded by the Spanish Government (contract TIC2003-07158-C04-01)
Idioma: Inglés

Artículos similares:

Choosing the correct paradigm for unknown words in rule-based machine translation systems por Sánchez Cartagena, Víctor Manuel,Esplà Gomis, Miquel,Sánchez Martínez, Felipe,Pérez Ortiz, Juan Antonio
Using external sources of bilingual information for on-the-fly word alignment por Esplà Gomis, Miquel,Sánchez Martínez, Felipe,Forcada Zubizarreta, Mikel L.
10