L
Título: Multi-Document Arabic Text Summarisation
Autores: El-Haj, Mahmoud
Kruschwitz, Udo
Fox, Chris
Fecha: 2011
Publicador: Institute of Electrical and Electronics Engineers (IEEE)
Fuente:
Tipo: Book Section
PeerReviewed
Tema: P Philology. Linguistics
QA75 Electronic computers. Computer science
Descripción: In this paper we present our generic extractive Arabic and English multi-document summarisers. We also describe the use of machine translation for evaluating the generated Arabic multi-document summaries using English extractive gold standards. In this work we first address the lack of Arabic multi-document corpora for summarisation and the absence of automatic and manual Arabic gold-standard summaries. These are required to evaluate any automatic Arabic summarisers. Second, we demonstrate the use of Google Translate in creating an Arabic version of the DUC-2002 dataset. The parallel Arabic/English dataset is summarised using the Arabic and English summarisation systems. The automatically generated summaries are evaluated using the ROUGE metric, as well as precision and recall. The results we achieve are compared with the top five systems in the DUC-2002 multi-document summarisation task.
Idioma: No aplica