Creating ontologies for content representation - the OntoSeed suite
Creating ontologies for content representation - the OntoSeed suite
Due to the inherent difficulties associated with manual ontology building, knowledge acquisition approaches such as ontology reuse or ontology learning from texts are often seen as instruments that can make this tedious process easier. In this paper we present a NLP-based method to aid ontology design in a specific application scenario, namely that in which the resulting ontology is used to support the semantic annotation of text documents. The proposed method uses the World Wide Web in its analysis of the domain-specific documents, thereby greatly reducing the need for linguistic expertise and resources, and suggests ways to specify domain ontologies in a “linguistics-friendly” format in order to improve further ontology-based natural language processing tasks such as semantic annotation. We present a thorough evaluation of the method, using corpora from three diverse real-world settings (medical information, tourism, and recipes). Additionally, for the first scenario we compare the costs and the benefits of the NLP-based ontology engineering approach against a similar, reuse-oriented experiment.
141-166
Simperl, E.
40261ae4-c58c-48e4-b78b-5187b10e4f67
Schlangen, D.
ff1e2e45-bdd4-46ca-b82f-89095a26cd78
2007
Simperl, E.
40261ae4-c58c-48e4-b78b-5187b10e4f67
Schlangen, D.
ff1e2e45-bdd4-46ca-b82f-89095a26cd78
Simperl, E. and Schlangen, D.
(2007)
Creating ontologies for content representation - the OntoSeed suite.
Journal of Data Semantics IX, 4601, .
(doi:10.1007/978-3-540-74987-5_5).
Abstract
Due to the inherent difficulties associated with manual ontology building, knowledge acquisition approaches such as ontology reuse or ontology learning from texts are often seen as instruments that can make this tedious process easier. In this paper we present a NLP-based method to aid ontology design in a specific application scenario, namely that in which the resulting ontology is used to support the semantic annotation of text documents. The proposed method uses the World Wide Web in its analysis of the domain-specific documents, thereby greatly reducing the need for linguistic expertise and resources, and suggests ways to specify domain ontologies in a “linguistics-friendly” format in order to improve further ontology-based natural language processing tasks such as semantic annotation. We present a thorough evaluation of the method, using corpora from three diverse real-world settings (medical information, tourism, and recipes). Additionally, for the first scenario we compare the costs and the benefits of the NLP-based ontology engineering approach against a similar, reuse-oriented experiment.
This record has no associated files available for download.
More information
Published date: 2007
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 351608
URI: http://eprints.soton.ac.uk/id/eprint/351608
PURE UUID: f792b8d9-529d-41df-925e-0a8b792860bb
Catalogue record
Date deposited: 13 May 2013 14:31
Last modified: 14 Mar 2024 13:41
Export record
Altmetrics
Contributors
Author:
D. Schlangen
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics