The University of Southampton
University of Southampton Institutional Repository

Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation

Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation
Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation
The Web is probably the largest and richest information repository available today. Search engines are the common access routes to this valuable source. However, the role of these search engines is often limited to the retrieval of lists of potentially relevant documents. The burden of analysing the returned documents and identifying the knowledge of interest is therefore left to the user. The Artequakt system aims to deploy natural language tools to automatically extract and consolidate knowledge from web documents and instantiate a given ontology, which dictates the type and form of knowledge to extract. Artequakt focuses on the domain of artists, and uses the harvested knowledge to generate tailored biographies. This paper describes the latest developments of the system and discusses the problem of knowledge consolidation.
Information Extraction, Ontology Instantiation, Knowledge Consolidation
Alani, Harith
70cdbdce-1494-44c2-9dae-65d82bf7e991
Kim, Sanghee
9e0e5909-9fbe-4c37-9606-2fdea35eac12
Millard, David E.
4f19bca5-80dc-4533-a101-89a5a0e3b372
Weal, Mark J.
57f33b97-406c-4008-a7be-0f2cd30689f5
Hall, Wendy
11f7f8db-854c-4481-b1ae-721a51d8790c
Lewis, Paul H.
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Shadbolt, Nigel
5c5acdf4-ad42-49b6-81fe-e9db58c2caf7
Alani, Harith
70cdbdce-1494-44c2-9dae-65d82bf7e991
Kim, Sanghee
9e0e5909-9fbe-4c37-9606-2fdea35eac12
Millard, David E.
4f19bca5-80dc-4533-a101-89a5a0e3b372
Weal, Mark J.
57f33b97-406c-4008-a7be-0f2cd30689f5
Hall, Wendy
11f7f8db-854c-4481-b1ae-721a51d8790c
Lewis, Paul H.
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Shadbolt, Nigel
5c5acdf4-ad42-49b6-81fe-e9db58c2caf7

Alani, Harith, Kim, Sanghee, Millard, David E., Weal, Mark J., Hall, Wendy, Lewis, Paul H. and Shadbolt, Nigel (2003) Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation. Knowledge Capture (K-Cap'03), Workshop on Knowledge Markup and Semantic Annotation, Sanibel Island, Florida, United States.

Record type: Conference or Workshop Item (Other)

Abstract

The Web is probably the largest and richest information repository available today. Search engines are the common access routes to this valuable source. However, the role of these search engines is often limited to the retrieval of lists of potentially relevant documents. The burden of analysing the returned documents and identifying the knowledge of interest is therefore left to the user. The Artequakt system aims to deploy natural language tools to automatically extract and consolidate knowledge from web documents and instantiate a given ontology, which dictates the type and form of knowledge to extract. Artequakt focuses on the domain of artists, and uses the harvested knowledge to generate tailored biographies. This paper describes the latest developments of the system and discusses the problem of knowledge consolidation.

Text
Alani-SEMANNOT-camera-ready.pdf - Other
Download (390kB)

More information

Published date: 2003
Additional Information: Event Dates: October 26
Venue - Dates: Knowledge Capture (K-Cap'03), Workshop on Knowledge Markup and Semantic Annotation, Sanibel Island, Florida, United States, 2003-10-26
Keywords: Information Extraction, Ontology Instantiation, Knowledge Consolidation
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 258325
URI: http://eprints.soton.ac.uk/id/eprint/258325
PURE UUID: 4d72c4ef-6f72-4096-85cf-d5b3d81c096f
ORCID for David E. Millard: ORCID iD orcid.org/0000-0002-7512-2710
ORCID for Wendy Hall: ORCID iD orcid.org/0000-0003-4327-7811

Catalogue record

Date deposited: 18 Oct 2003
Last modified: 15 Mar 2024 02:58

Export record

Contributors

Author: Harith Alani
Author: Sanghee Kim
Author: David E. Millard ORCID iD
Author: Mark J. Weal
Author: Wendy Hall ORCID iD
Author: Paul H. Lewis
Author: Nigel Shadbolt

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×