Flexible Ontology Population from Text: The OwlExporter

Abstract

Ontology population from text is becoming increasingly important for NLP applications. Ontologies in OWL format provide for a standardized means of modeling, querying, and reasoning over large knowledge bases. Populated from natural language texts, they offer significant advantages over traditional export formats, such as plain XML. The development of text analysis systems has been greatly facilitated by modern NLP frameworks, such as the General Architecture for Text Engineering (GATE). However, ontology population is not currently supported by a standard component. We developed a GATE resource called the OwlExporter that allows to easily map existing NLP analysis pipelines to OWL ontologies, thereby allowing language engineers to create ontology population systems without requiring extensive knowledge of ontology APIs. A particular feature of our approach is the concurrent population and linking of a domain- and NLP-ontology, including NLP-specific features such as safe reasoning over coreference chains.

Reference

René Witte, Ninus Khamis, and Juergen Rilling. Flexible Ontology Population from Text: The OwlExporter. The Seventh International Conference on Language Resources and Evaluation (LREC 2010), pp.3845–3850, May 19–21, 2010, Valletta, Malta.

Bibtex entry (also for download):

@inproceedings{wkr10,
  author = {Ren{\'e} Witte and Ninus Khamis and Juergen Rilling},
  title = {{Flexible Ontology Population from Text: The OwlExporter}},
  booktitle = {The Seventh International Conference on Language Resources and Evaluation (LREC 2010)},
  pages = {3845--3850},
  year = {2010},
  month = {May 19--21},
  publisher = {ELRA},
  isbn = {2-9517408-6-7},
  address = {Valletta, Malta}
}

Software

The OwlExporter GATE component described in this paper is available as open source software.

Download

Our paper only: 932_Paper.pdf. Also available: local copy.
MD5 checksum: 12b99fd673278b05a50af2a6609f4f54
You can also browse the complete conference proceedings

Copyright © 2010 René Witte, Ninus Khamis, and Juergen Rilling. All rights reserved.