The University of Southampton
University of Southampton Institutional Repository

Statistical analysis of the owl:sameAs network for aligning concepts in the linking open data cloud

Statistical analysis of the owl:sameAs network for aligning concepts in the linking open data cloud
Statistical analysis of the owl:sameAs network for aligning concepts in the linking open data cloud
The massively distributed publication of linked data has brought to the attention of scientific community the limitations of classic methods for achieving data integration and the opportunities of pushing the boundaries of the field by experimenting this collective enterprise that is the linking open data cloud. While reusing existing ontologies is the choice of preference, the exploitation of ontology alignments still is a required step for easing the burden of integrating heterogeneous data sets. Alignments, even between the most used vocabularies, is still poorly supported in systems nowadays whereas links between instances are the most widely used means for bridging the gap between different data sets. We provide in this paper an account of our statistical and qualitative analysis of the network of instance level equivalences in the Linking Open Data Cloud (i.e. the sameAs network) in order to automatically compute alignments at the conceptual level. Moreover, we explore the effect of ontological information when adopting classical Jaccard methods to the ontology alignment task. Automating such task will allow in fact to achieve a clearer conceptual description of the data at the cloud level, while improving the level of integration between datasets.
978-3-642-32596-0
Correndo, Gianluca
fea0843a-6d4a-4136-8784-0d023fcde3e2
Penta, Antonio
dd594010-25ac-4126-875c-20af78040c45
Gibbins, Nicholas
98efd447-4aa7-411c-86d1-955a612eceac
Shadbolt, Nigel
5c5acdf4-ad42-49b6-81fe-e9db58c2caf7
Correndo, Gianluca
fea0843a-6d4a-4136-8784-0d023fcde3e2
Penta, Antonio
dd594010-25ac-4126-875c-20af78040c45
Gibbins, Nicholas
98efd447-4aa7-411c-86d1-955a612eceac
Shadbolt, Nigel
5c5acdf4-ad42-49b6-81fe-e9db58c2caf7

Correndo, Gianluca, Penta, Antonio, Gibbins, Nicholas and Shadbolt, Nigel (2012) Statistical analysis of the owl:sameAs network for aligning concepts in the linking open data cloud. International Conference on Database and Expert Systems Applications (DEXA 2012 ), Wien, Austria. 03 - 07 Sep 2012. 15 pp . (doi:10.1007/978-3-642-32597-7_20).

Record type: Conference or Workshop Item (Paper)

Abstract

The massively distributed publication of linked data has brought to the attention of scientific community the limitations of classic methods for achieving data integration and the opportunities of pushing the boundaries of the field by experimenting this collective enterprise that is the linking open data cloud. While reusing existing ontologies is the choice of preference, the exploitation of ontology alignments still is a required step for easing the burden of integrating heterogeneous data sets. Alignments, even between the most used vocabularies, is still poorly supported in systems nowadays whereas links between instances are the most widely used means for bridging the gap between different data sets. We provide in this paper an account of our statistical and qualitative analysis of the network of instance level equivalences in the Linking Open Data Cloud (i.e. the sameAs network) in order to automatically compute alignments at the conceptual level. Moreover, we explore the effect of ontological information when adopting classical Jaccard methods to the ontology alignment task. Automating such task will allow in fact to achieve a clearer conceptual description of the data at the cloud level, while improving the level of integration between datasets.

Text
correndo_dexa_2012.pdf - Author's Original
Download (583kB)

More information

Published date: 5 September 2012
Venue - Dates: International Conference on Database and Expert Systems Applications (DEXA 2012 ), Wien, Austria, 2012-09-03 - 2012-09-07
Related URLs:
Organisations: Web & Internet Science, IT Innovation

Identifiers

Local EPrints ID: 340166
URI: http://eprints.soton.ac.uk/id/eprint/340166
ISBN: 978-3-642-32596-0
PURE UUID: 2bb5c167-365c-4637-b5eb-eb9bd8314456
ORCID for Gianluca Correndo: ORCID iD orcid.org/0000-0003-3335-5759
ORCID for Nicholas Gibbins: ORCID iD orcid.org/0000-0002-6140-9956

Catalogue record

Date deposited: 15 Jun 2012 13:48
Last modified: 15 Mar 2024 03:00

Export record

Altmetrics

Contributors

Author: Gianluca Correndo ORCID iD
Author: Antonio Penta
Author: Nicholas Gibbins ORCID iD
Author: Nigel Shadbolt

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×