The University of Southampton
University of Southampton Institutional Repository

Combining POS tagging, lucene search and similarity metrics for entity linking

Combining POS tagging, lucene search and similarity metrics for entity linking
Combining POS tagging, lucene search and similarity metrics for entity linking
Entity linking is to detect proper nouns or concrete concepts (a.k.a mentions) from documents, and to map them to the corresponding entries in a given knowledge base. In this paper, we propose an entity linking framework POSLS consisting of three components: mention detection, candidate selection and entity disambiguation. First, we use part of speech tagging and English syntactic rules to detect mentions. We then choose candidates with Lucene search. Finally, we identify the best matchings with a similarity based disambiguation method. Experimental results show that our approach has an acceptable accuracy.
entity linking, pos tagging, lucene search, similarity metrics, mention detection
978-3-642-41229-5
8180
503-509
Springer
Zhao, Shujuan
57419c07-b419-4f03-af06-6828e53db834
Li, Chune
9922e361-dbe6-47a3-8502-ff749152b375
Ma, Shuai
232b6412-5735-42cd-9cbc-80352bff65ce
Ma, Tiejun
1f591849-f17c-4209-9f42-e6587b499bae
Ma, Dianfu
96986a0e-15d4-4b81-8482-59b8ed03863d
Zhao, Shujuan
57419c07-b419-4f03-af06-6828e53db834
Li, Chune
9922e361-dbe6-47a3-8502-ff749152b375
Ma, Shuai
232b6412-5735-42cd-9cbc-80352bff65ce
Ma, Tiejun
1f591849-f17c-4209-9f42-e6587b499bae
Ma, Dianfu
96986a0e-15d4-4b81-8482-59b8ed03863d

Zhao, Shujuan, Li, Chune, Ma, Shuai, Ma, Tiejun and Ma, Dianfu (2013) Combining POS tagging, lucene search and similarity metrics for entity linking. In Web Information Systems Engineering – WISE 2013. Springer. pp. 503-509 . (doi:10.1007/978-3-642-41230-1_44).

Record type: Conference or Workshop Item (Paper)

Abstract

Entity linking is to detect proper nouns or concrete concepts (a.k.a mentions) from documents, and to map them to the corresponding entries in a given knowledge base. In this paper, we propose an entity linking framework POSLS consisting of three components: mention detection, candidate selection and entity disambiguation. First, we use part of speech tagging and English syntactic rules to detect mentions. We then choose candidates with Lucene search. Finally, we identify the best matchings with a similarity based disambiguation method. Experimental results show that our approach has an acceptable accuracy.

This record has no associated files available for download.

More information

Published date: 2013
Venue - Dates: 14th International Conference Web Information Systems Engineering (WISE), Nanjing, China, 2013-10-13 - 2013-10-15
Keywords: entity linking, pos tagging, lucene search, similarity metrics, mention detection
Organisations: Southampton Business School

Identifiers

Local EPrints ID: 366855
URI: http://eprints.soton.ac.uk/id/eprint/366855
ISBN: 978-3-642-41229-5
PURE UUID: 725dc259-c0d2-49fe-9bc7-9bb17d9dd1a4

Catalogue record

Date deposited: 15 Jul 2014 11:28
Last modified: 14 Mar 2024 17:18

Export record

Altmetrics

Contributors

Author: Shujuan Zhao
Author: Chune Li
Author: Shuai Ma
Author: Tiejun Ma
Author: Dianfu Ma

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×