Recording and Reasoning over Data Provenance in Web and Grid Services
Recording and Reasoning over Data Provenance in Web and Grid Services
Large-scale, dynamic and open environments such as the Grid and Web Services build upon existing computing infrastructures to supply dependable and consistent large-scale computational systems. This kind of architecture has been adopted by the business and scientific communities allowing them to exploit extensive and diverse computing resources to perform complex data processing tasks. In such systems, results are often derived by composing multiple, geographically distributed, heterogeneous services as specified by intricate workflow management. This leads to the undesirable situation where the results are known, but the means by which they were achieved is not. With both scientific experiments and business transactions, the notion of lineage and dataset derivation is of paramount importance since without it, information is potentially worthless. We address the issue of {\em data provenance\/}, the description of the origin of a piece of data, in these environments showing the requirements, uses and implementation difficulties. We propose an infrastructure level support for a provenance recording capability for service-oriented architectures such as the Grid and Web Services. We also offer services to view and retrieve provenance and we provide a mechanism by which provenance is used to determine whether previous computed results are still up to date.
3-540-20498-9
603-620
Szomszor, Martin
c797d2c4-7fd3-45f5-9aa6-474faf550786
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
2003
Szomszor, Martin
c797d2c4-7fd3-45f5-9aa6-474faf550786
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Szomszor, Martin and Moreau, Luc
(2003)
Recording and Reasoning over Data Provenance in Web and Grid Services.
International Conference on Ontologies, Databases and Applications of SEmantics (ODBASE'03), Catania, Sicily, Italy.
.
(doi:10.1007/978-3-540-39964-3_39).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Large-scale, dynamic and open environments such as the Grid and Web Services build upon existing computing infrastructures to supply dependable and consistent large-scale computational systems. This kind of architecture has been adopted by the business and scientific communities allowing them to exploit extensive and diverse computing resources to perform complex data processing tasks. In such systems, results are often derived by composing multiple, geographically distributed, heterogeneous services as specified by intricate workflow management. This leads to the undesirable situation where the results are known, but the means by which they were achieved is not. With both scientific experiments and business transactions, the notion of lineage and dataset derivation is of paramount importance since without it, information is potentially worthless. We address the issue of {\em data provenance\/}, the description of the origin of a piece of data, in these environments showing the requirements, uses and implementation difficulties. We propose an infrastructure level support for a provenance recording capability for service-oriented architectures such as the Grid and Web Services. We also offer services to view and retrieve provenance and we provide a mechanism by which provenance is used to determine whether previous computed results are still up to date.
Text
odbase03
- Accepted Manuscript
More information
Published date: 2003
Additional Information:
Event Dates: November
Venue - Dates:
International Conference on Ontologies, Databases and Applications of SEmantics (ODBASE'03), Catania, Sicily, Italy, 2003-11-01
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 259450
URI: http://eprints.soton.ac.uk/id/eprint/259450
ISBN: 3-540-20498-9
PURE UUID: 9f5b527f-63f4-4457-9e71-9cfd9d90fc43
Catalogue record
Date deposited: 28 Jun 2004
Last modified: 14 Mar 2024 06:24
Export record
Altmetrics
Contributors
Author:
Martin Szomszor
Author:
Luc Moreau
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics