The University of Southampton
University of Southampton Institutional Repository

LinksB2N: Automatic Data Integration for the Semantic Web

Record type: Conference or Workshop Item (Paper)

The ongoing trend towards open data embraced by the Semantic Web has started to produce a large number of data sources. These data sources are published using RDF vocabularies, and it is possible to navigate throughout the data due to their graph topology. This paper presents LinksB2N, an algorithm for discovering information overlaps in RDF data repositories and performing data integration with no human intervention over data sets that partially share the same domain. LinksB2N identifies equivalent RDF resources from different data sets with several degrees of confidence. The algorithm relies on a novel approach that uses clustering techniques to analyze the distribution of unique objects that contain overlapping information in different data graphs. Our contribution is illustrated in the context of the Market Blended Insight project by applying the LinksB2N algorithm to data sets in the order of hundreds of millions of RDF triples containing relevant information in the domain of business to business (B2B) marketing analysis.

PDF Salvadores-ODBASE-2009.pdf - Version of Record
Download (579kB)
PDF Salvadores_LinksB2N_2009.pdf - Other
Download (1MB)

Citation

Salvadores, Manuel, Correndo, Gianluca, Rodriguez-Castro, Benedicto, Gibbins, Nicholas, Darlington, John and Shadbolt, Nigel (2009) LinksB2N: Automatic Data Integration for the Semantic Web At International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE 2009).

More information

Published date: 9 June 2009
Venue - Dates: International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE 2009), 2009-06-09
Keywords: Data Integration, Semantic Web.
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 267861
URI: http://eprints.soton.ac.uk/id/eprint/267861
PURE UUID: 601f9d45-71c4-4484-81cb-f114620a6105
ORCID for Nicholas Gibbins: ORCID iD orcid.org/0000-0002-6140-9956

Catalogue record

Date deposited: 13 Sep 2009 11:58
Last modified: 18 Jul 2017 06:59

Export record

Contributors

Author: Manuel Salvadores
Author: Gianluca Correndo
Author: Benedicto Rodriguez-Castro
Author: Nicholas Gibbins ORCID iD
Author: John Darlington
Author: Nigel Shadbolt

University divisions


Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×