Statistical analysis of the owl:sameAs network for aligning concepts in the linking open data cloud


Correndo, Gianluca, Penta, Antonio, Gibbins, Nicholas and Shadbolt, Nigel (2012) Statistical analysis of the owl:sameAs network for aligning concepts in the linking open data cloud. In, International Conference on Database and Expert Systems Applications (DEXA 2012 ), Wien, AT, 03 - 07 Sep 2012. 15pp. (doi:10.1007/978-3-642-32597-7_20).

Download

[img] PDF (Accepted verision) - Pre print
Download (570Kb)

Description/Abstract

The massively distributed publication of linked data has brought to the attention of scientific community the limitations of classic methods for achieving data integration and the opportunities of pushing the boundaries of the field by experimenting this collective enterprise that is the linking open data cloud. While reusing existing ontologies is the choice of preference, the exploitation of ontology alignments still is a required step for easing the burden of integrating heterogeneous data sets. Alignments, even between the most used vocabularies, is still poorly supported in systems nowadays whereas links between instances are the most widely used means for bridging the gap between different data sets. We provide in this paper an account of our statistical and qualitative analysis of the network of instance level equivalences in the Linking Open Data Cloud (i.e. the sameAs network) in order to automatically compute alignments at the conceptual level. Moreover, we explore the effect of ontological information when adopting classical Jaccard methods to the ontology alignment task. Automating such task will allow in fact to achieve a clearer conceptual description of the data at the cloud level, while improving the level of integration between datasets.

Item Type: Conference or Workshop Item (Paper)
ISBNs: 9783642325960 (print)
9783642325977 (electronic)
Related URLs:
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Physical Sciences and Engineering > Electronics and Computer Science > IT Innovation Centre
Faculty of Physical Sciences and Engineering > Electronics and Computer Science > Web & Internet Science
ePrint ID: 340166
Date Deposited: 15 Jun 2012 13:48
Last Modified: 14 Apr 2014 11:42
Research Funder: EPSRC
Projects:
EnAKTing the Unbounded Data Web: Challenges in Web Science
Funded by: EPSRC (EP/G008493/1)
Led by: Nigel Shadbolt
1 April 2009 to 30 September 2012
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/340166

Actions (login required)

View Item View Item