The Impact of Enriched Linguistic Annotation on the Performance of Extracting Relation Triples


Kim, Sanghee, Lewis, Paul and Martinez, Kirk, Gelbukh, Alexander (ed.) (2004) The Impact of Enriched Linguistic Annotation on the Performance of Extracting Relation Triples. Computational Linguistics and Intelligent Text Processing, Lectur, (2945), 547-558.

Download

[img] PDF
Download (96Kb)

Description/Abstract

A relation extraction system recognises pre-defined relation types between two identified entities from natural language documents. It is important for a task of automatically locating missing instances in knowledge base where the instance is represented as a triple (‘entity – relation – entity’). A relation entry specifies a set of rules associated with the syntactic and semantic conditions under which appropriate relations would be extracted. Manually creating such rules requires knowledge from information experts and moreover, it is a time-consuming and error-prone task when the input sentences have little consistency in terms of structures and vocabularies. In this paper, we present an approach for applying a symbolic learning algorithm to sentences in order to automatically induce the extraction rules which then successfully classify a new sentence. The proposed approach takes into account semantic attributes (e.g., semantically close words) as well as linguistic features(entity types) in generalising common patterns among the sentences which enable the system to cope better with syntactically different but semantically similar sentences. Not only does this increase the number of relations extracted, but it also improves the accuracy in extracting relations by adding features which might not be discovered only with syntactic analysis. Experimental results show that this approach is effective on the sentences of the Web documents obtaining 17% higher precision and 34% higher recall values.

Item Type: Article
ISSNs: 030297343
Keywords: relation extraction, information extraction, inductive logic programming
Divisions: Faculty of Physical and Applied Science > Electronics and Computer Science > Web & Internet Science
Item ID: 258880
Date Deposited: 25 Feb 2004
Last Modified: 18 Aug 2012 03:24
Contributors: Kim, Sanghee (Author)
Lewis, Paul (Author)
Martinez, Kirk (Author)
Gelbukh, Alexander (Editor)
Date: February 2004
Status: Published
Publisher: Springer
Further Information:Google Scholar
ISI Citation Count:0
URI: http://eprints.soton.ac.uk/id/eprint/258880

Actions (login required)

View Item View Item