The University of Southampton
University of Southampton Institutional Repository

A Semantically enabled architecture for crowdsourced linked data management

A Semantically enabled architecture for crowdsourced linked data management
A Semantically enabled architecture for crowdsourced linked data management
Increasing amounts of structured data are exposed on the Web using graph-based representation models and protocols such as RDF and SPARQL. Nevertheless, while the overall volume of such open, or easily accessible, data sources reaches critical mass, the ability of potential consumers to use them in novel applications and services is predicated on the availability of purposeful means to query and manage the data, while taking into account and mastering its essential features in terms of decentralization, heterogeneity of schema, varying quality, and scale. Many aspects of these challenges are necessarily tackled through a combination of algorithmic techniques and manual effort. In the literature on traditional data management the theoretical and technical groundwork to realize and manage such combinations is being established. In this paper we build upon these ideas and propose a semantically enabled architecture for crowdsourced data management systems which uses formal representations of tasks and data to automatically design and optimize the operation and outcomes of human computation projects. The architecture is applied to the context of Linked Data management to address specific challenges of LinkedData query processing such as identity resolution and ontological classification. Starting from a motivational scenario we explain how query-processing tasks can be decomposed and translated into MTurk projects using our semantic approach, and roadmap the extensions to graph-based data management technology that are required to achieve this vision.
9-14
Simperl, E.
40261ae4-c58c-48e4-b78b-5187b10e4f67
Acosta, M.
90523232-6ad8-4e33-aac9-1870a7453489
Norton, B.
24e57777-0135-412a-b93f-d24fca273091
Simperl, E.
40261ae4-c58c-48e4-b78b-5187b10e4f67
Acosta, M.
90523232-6ad8-4e33-aac9-1870a7453489
Norton, B.
24e57777-0135-412a-b93f-d24fca273091

Simperl, E., Acosta, M. and Norton, B. (2012) A Semantically enabled architecture for crowdsourced linked data management. First International Workshop on Crowdsourcing Web Search (CrowdSearch 2012), Lyon, France. 16 Apr 2012. pp. 9-14 .

Record type: Conference or Workshop Item (Paper)

Abstract

Increasing amounts of structured data are exposed on the Web using graph-based representation models and protocols such as RDF and SPARQL. Nevertheless, while the overall volume of such open, or easily accessible, data sources reaches critical mass, the ability of potential consumers to use them in novel applications and services is predicated on the availability of purposeful means to query and manage the data, while taking into account and mastering its essential features in terms of decentralization, heterogeneity of schema, varying quality, and scale. Many aspects of these challenges are necessarily tackled through a combination of algorithmic techniques and manual effort. In the literature on traditional data management the theoretical and technical groundwork to realize and manage such combinations is being established. In this paper we build upon these ideas and propose a semantically enabled architecture for crowdsourced data management systems which uses formal representations of tasks and data to automatically design and optimize the operation and outcomes of human computation projects. The architecture is applied to the context of Linked Data management to address specific challenges of LinkedData query processing such as identity resolution and ontological classification. Starting from a motivational scenario we explain how query-processing tasks can be decomposed and translated into MTurk projects using our semantic approach, and roadmap the extensions to graph-based data management technology that are required to achieve this vision.

This record has no associated files available for download.

More information

Published date: 17 April 2012
Venue - Dates: First International Workshop on Crowdsourcing Web Search (CrowdSearch 2012), Lyon, France, 2012-04-16 - 2012-04-16
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 351664
URI: http://eprints.soton.ac.uk/id/eprint/351664
PURE UUID: 77caf87c-0ff7-46e7-a755-2699befd3c30
ORCID for E. Simperl: ORCID iD orcid.org/0000-0003-1722-947X

Catalogue record

Date deposited: 29 Apr 2013 15:41
Last modified: 09 Jan 2022 03:42

Export record

Contributors

Author: E. Simperl ORCID iD
Author: M. Acosta
Author: B. Norton

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×