Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation
Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation
The Web is probably the largest and richest information repository available today. Search engines are the common access routes to this valuable source. However, the role of these search engines is often limited to the retrieval of lists of potentially relevant documents. The burden of analysing the returned documents and identifying the knowledge of interest is therefore left to the user. The Artequakt system aims to deploy natural language tools to automatically extract and consolidate knowledge from web documents and instantiate a given ontology, which dictates the type and form of knowledge to extract. Artequakt focuses on the domain of artists, and uses the harvested knowledge to generate tailored biographies. This paper describes the latest developments of the system and discusses the problem of knowledge consolidation.
Information Extraction, Ontology Instantiation, Knowledge Consolidation
Alani, Harith
70cdbdce-1494-44c2-9dae-65d82bf7e991
Kim, Sanghee
9e0e5909-9fbe-4c37-9606-2fdea35eac12
Millard, David E.
4f19bca5-80dc-4533-a101-89a5a0e3b372
Weal, Mark J.
57f33b97-406c-4008-a7be-0f2cd30689f5
Hall, Wendy
11f7f8db-854c-4481-b1ae-721a51d8790c
Lewis, Paul H.
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Shadbolt, Nigel
5c5acdf4-ad42-49b6-81fe-e9db58c2caf7
2003
Alani, Harith
70cdbdce-1494-44c2-9dae-65d82bf7e991
Kim, Sanghee
9e0e5909-9fbe-4c37-9606-2fdea35eac12
Millard, David E.
4f19bca5-80dc-4533-a101-89a5a0e3b372
Weal, Mark J.
57f33b97-406c-4008-a7be-0f2cd30689f5
Hall, Wendy
11f7f8db-854c-4481-b1ae-721a51d8790c
Lewis, Paul H.
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Shadbolt, Nigel
5c5acdf4-ad42-49b6-81fe-e9db58c2caf7
Alani, Harith, Kim, Sanghee, Millard, David E., Weal, Mark J., Hall, Wendy, Lewis, Paul H. and Shadbolt, Nigel
(2003)
Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation.
Knowledge Capture (K-Cap'03), Workshop on Knowledge Markup and Semantic Annotation, Sanibel Island, Florida, United States.
Record type:
Conference or Workshop Item
(Other)
Abstract
The Web is probably the largest and richest information repository available today. Search engines are the common access routes to this valuable source. However, the role of these search engines is often limited to the retrieval of lists of potentially relevant documents. The burden of analysing the returned documents and identifying the knowledge of interest is therefore left to the user. The Artequakt system aims to deploy natural language tools to automatically extract and consolidate knowledge from web documents and instantiate a given ontology, which dictates the type and form of knowledge to extract. Artequakt focuses on the domain of artists, and uses the harvested knowledge to generate tailored biographies. This paper describes the latest developments of the system and discusses the problem of knowledge consolidation.
Text
Alani-SEMANNOT-camera-ready.pdf
- Other
More information
Published date: 2003
Additional Information:
Event Dates: October 26
Venue - Dates:
Knowledge Capture (K-Cap'03), Workshop on Knowledge Markup and Semantic Annotation, Sanibel Island, Florida, United States, 2003-10-26
Keywords:
Information Extraction, Ontology Instantiation, Knowledge Consolidation
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 258325
URI: http://eprints.soton.ac.uk/id/eprint/258325
PURE UUID: 4d72c4ef-6f72-4096-85cf-d5b3d81c096f
Catalogue record
Date deposited: 18 Oct 2003
Last modified: 15 Mar 2024 02:58
Export record
Contributors
Author:
Harith Alani
Author:
Sanghee Kim
Author:
David E. Millard
Author:
Mark J. Weal
Author:
Paul H. Lewis
Author:
Nigel Shadbolt
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics