The University of Southampton
University of Southampton Institutional Repository

Recording and Reasoning over Data Provenance in Web and Grid Services

Recording and Reasoning over Data Provenance in Web and Grid Services
Recording and Reasoning over Data Provenance in Web and Grid Services
Large-scale, dynamic and open environments such as the Grid and Web Services build upon existing computing infrastructures to supply dependable and consistent large-scale computational systems. This kind of architecture has been adopted by the business and scientific communities allowing them to exploit extensive and diverse computing resources to perform complex data processing tasks. In such systems, results are often derived by composing multiple, geographically distributed, heterogeneous services as specified by intricate workflow management. This leads to the undesirable situation where the results are known, but the means by which they were achieved is not. With both scientific experiments and business transactions, the notion of lineage and dataset derivation is of paramount importance since without it, information is potentially worthless. We address the issue of {\em data provenance\/}, the description of the origin of a piece of data, in these environments showing the requirements, uses and implementation difficulties. We propose an infrastructure level support for a provenance recording capability for service-oriented architectures such as the Grid and Web Services. We also offer services to view and retrieve provenance and we provide a mechanism by which provenance is used to determine whether previous computed results are still up to date.
3-540-20498-9
603-620
Szomszor, Martin
c797d2c4-7fd3-45f5-9aa6-474faf550786
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Szomszor, Martin
c797d2c4-7fd3-45f5-9aa6-474faf550786
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8

Szomszor, Martin and Moreau, Luc (2003) Recording and Reasoning over Data Provenance in Web and Grid Services. International Conference on Ontologies, Databases and Applications of SEmantics (ODBASE'03), Catania, Sicily, Italy. pp. 603-620 . (doi:10.1007/978-3-540-39964-3_39).

Record type: Conference or Workshop Item (Paper)

Abstract

Large-scale, dynamic and open environments such as the Grid and Web Services build upon existing computing infrastructures to supply dependable and consistent large-scale computational systems. This kind of architecture has been adopted by the business and scientific communities allowing them to exploit extensive and diverse computing resources to perform complex data processing tasks. In such systems, results are often derived by composing multiple, geographically distributed, heterogeneous services as specified by intricate workflow management. This leads to the undesirable situation where the results are known, but the means by which they were achieved is not. With both scientific experiments and business transactions, the notion of lineage and dataset derivation is of paramount importance since without it, information is potentially worthless. We address the issue of {\em data provenance\/}, the description of the origin of a piece of data, in these environments showing the requirements, uses and implementation difficulties. We propose an infrastructure level support for a provenance recording capability for service-oriented architectures such as the Grid and Web Services. We also offer services to view and retrieve provenance and we provide a mechanism by which provenance is used to determine whether previous computed results are still up to date.

Text
odbase03 - Accepted Manuscript
Download (13MB)

More information

Published date: 2003
Additional Information: Event Dates: November
Venue - Dates: International Conference on Ontologies, Databases and Applications of SEmantics (ODBASE'03), Catania, Sicily, Italy, 2003-11-01
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 259450
URI: http://eprints.soton.ac.uk/id/eprint/259450
ISBN: 3-540-20498-9
PURE UUID: 9f5b527f-63f4-4457-9e71-9cfd9d90fc43
ORCID for Luc Moreau: ORCID iD orcid.org/0000-0002-3494-120X

Catalogue record

Date deposited: 28 Jun 2004
Last modified: 14 Mar 2024 06:24

Export record

Altmetrics

Contributors

Author: Martin Szomszor
Author: Luc Moreau ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×