Título: Converting Taxonomic Descriptions to New Digital Formats
Autores: cui, hong
Fecha: 2008-01-20
Publicador: Biodiversity Informatics
Fuente:
Tipo: info:eu-repo/semantics/article
Peer-reviewed Article
info:eu-repo/semantics/publishedVersion
Tema: Taxonomic descriptions, morphological descriptions, semantic markup, supervised machine learning, unsupervised machine learning, system evaluation, XML
Descripción: Abstract.--The majority of taxonomic descriptions is currently in print format. The majority of digital descriptions are in formats such as DOC, HTML, or PDF and for human readers. These formats do not convey rich semantics in taxonomic descriptions for computer-aided process. Newer digital formats such as XML and RDF accommodate semantic annotations that allow computers to process the rich semantics on human's behalf, thus open up opportunities for a wide range of innovative usages of taxonomic descriptions, such as searching in more precise and flexible ways, integrating with gnomic and geographic information, generating taxonomic keys automatically, and text data mining and information visualization etc. This paper discusses the challenges in automated conversion of multiple collections of descriptions to XML format and reports an automated system, MARTT. MARTT is a machine-learning system that makes use of training examples to tag new descriptions into XML format. A number of utilities are implemented as solutions to the challenges. The utilities are used to reduce the effort for training example preparation, to facilitate the creation of a comprehensive schema, and to predict system performance on a new collection of descriptions. The system has been tested with several plant and alga taxonomic publications including Flora of China and Flora of North America.
Idioma: Inglés

Artículos similares:

Global Biodiversity Informatics: setting the scene for a “new world” of ecological forecasting por Canhos, Vanderlei Perez; Centro de Referência em Informação Ambiental, CRIA,Souza, Sidnei de; Centro de Referência em Informação Ambiental, CRIA,Giovanni, Renato De; Centro de Referência em Informação Ambiental, CRIA,Canhos, Dora Ann Lange; Centro de Referência em Informação Ambiental, CRIA
Interpretation of Models of Fundamental Ecological Niches and Species’ Distributional Areas por Soberon, Jorge; CONABIO,Peterson, A. Townsend; Natural History Museum, KU
Place prioritization for biodiversity content using species ecological niche modeling por Sánchez-Cordero, Víctor; Departamento de Zoologia, Instituto de Biologia, Universidad Nacional Autonoma de Mexico.,Cirelli, Verónica; Departamento de Zoologia, Instituto de Biologia, Universidad Nacional Autonoma de Mexico.,Munguial, Mariana; Departamento de Zoologia, Instituto de Biologia, Universidad Nacional Autonoma de Mexico.,Sarkar, Sahotra; Section of Integrative Biology and Department of Philosophy, University of Texas
Environmental Information: Placing Biodiversity Phenomena in an Ecological and Environmental Context por Chapman, Arthur D; Australian Biodiversity information Services,Muñoz, Mauro E.S.; Centro de Referência em Informação Ambiental (CRIA),Koch, Ingrid; Centro de Referência em Informação Ambiental (CRIA),
Resolving taxonmic discrepancies: Role of Electronic Catalogues of Known Organisms por Chavan, Vishwas Shravan; National Chemical Laboratory,Rane, Nilesh Sunil; National Chemical Laboratory,Watve, Aparna; National Chemical Laboratory,Ruggiero, Michael; Integrated Taxonomic Information System, US Geological Survey, Smithsonian Institution, Washington DC, USA
Bioinformatics, the Clearing-House Mechanism and the Convention on Biological Diversity por Silva, Marcos R.; Secretariat of the Convention on Biological Diversity
TaxonGrab: Extracting Taxonomic Names From Text por Koning, Drew; American Museum of Natural History,Sarkar, Indra Neil; American Museum of Natural History,Moritz, Thomas; American Museum of Natural History
Mammals of the World: MaNIS as an example of data integration in a distributed network environment por Stein, Barbara R; Museum of Vertebrate Zoology,Wieczorek, John R.; Museum of Vertebrate Zoology
10