Título: An algorithm for anaphora resolution in Spanish texts
Autores: Palomar Sanz, Manuel
Ferrández Rodríguez, Antonio
Moreno Boronat, Lidia
Martínez Barco, Patricio
Peral Cortés, Jesús
Saiz Noeda, Maximiliano
Muñoz Guillena, Rafael
Fecha: 2007-10-28
2007-10-28
2001-12
Publicador: RUA Docencia
Fuente:
Tipo: info:eu-repo/semantics/article
Tema: Anaphora resolution
Natural language
Lenguajes y Sistemas Informáticos
Descripción: This paper presents an algorithm for identifying noun phrase antecedents of third person personal pronouns, demonstrative pronouns, reflexive pronouns, and omitted pronouns (zero pronouns) in unrestricted Spanish texts. We define a list of constraints and preferences for different types of pronominal expressions, and we document in detail the importance of each kind of knowledge (lexical, morphological, syntactic, and statistical) in anaphora resolution for Spanish. The paper also provides a definition for syntactic conditions on Spanish NP-pronoun noncoreference using partial parsing. The algorithm has been evaluated on a corpus of 1,677 pronouns and achieved a success rate of 76.8%. We have also implemented four competitive algorithms and tested their performance in a blind evaluation on the same test corpus. This new approach could easily be extended to other languages such as English, Portuguese, Italian, or Japanese.
This work has been supported by the Spanish government (CICYT) with Grant TIC97-0671-C02-01/02.
Idioma: Inglés

Artículos similares:

Choosing the correct paradigm for unknown words in rule-based machine translation systems por Sánchez Cartagena, Víctor Manuel,Esplà Gomis, Miquel,Sánchez Martínez, Felipe,Pérez Ortiz, Juan Antonio
Using external sources of bilingual information for on-the-fly word alignment por Esplà Gomis, Miquel,Sánchez Martínez, Felipe,Forcada Zubizarreta, Mikel L.
10