Discovering cross-language links in Wikipedia through semantic relatedness


Penta, Antonio, Quercini, Gianluca, Chantal, Reynaud and Shadbolt, Nigel (2012) Discovering cross-language links in Wikipedia through semantic relatedness. In, 20th European Conference on Artificial Intelligence (ECAI 2012), Montpellier, FR, 27 - 31 Aug 2012.

Download

Full text not available from this repository.

Description/Abstract

Wikipedia is a large multilingual collection of interlinked articles, used and contributed by millions of users over the Internet, that provides editions in up to 283 languages. Two articles in different language versions of Wikipedia may have information on the exactly the same concept, in which case they are often connected through a cross-language link. However, many cross-language links are either missing or incorrect and this negatively affects both the readers of Wikipedia and multilingual information retrieval applications. In this paper, we propose WikiCL, an algorithm for discoverinrg cross-language links using the semantic relatedness of two articles derived from the Wikipedia graph structure. Our evaluation shows that we achieve comparable, and in some cases, better results than previous methods with much less computational time

Item Type: Conference or Workshop Item (Paper)
Related URLs:
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
H Social Sciences > HE Transportation and Communications
Divisions: Faculty of Physical Sciences and Engineering > Electronics and Computer Science
ePrint ID: 340145
Date Deposited: 13 Jun 2012 13:16
Last Modified: 27 Mar 2014 20:22
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/340145

Actions (login required)

View Item View Item