The requirements of recording and using provenance in e-Science experiments
The requirements of recording and using provenance in e-Science experiments
In e-Science experiments, it is vital to record the experimental process for later use such as in interpreting results, verifying that the correct process took place or tracing where data came from. The process that led to some data is called the provenance of that data, and a provenance architecture is the software architecture for a system that will provide the necessary functionality to record, store and use process documentation to determine the provenance of data items. However, there has been little principled analysis of what is actually required of a provenance architecture, so it is impossible to determine the functionality they would ideally support. In this paper, we present use cases for a provenance architecture from current experiments in biology, chemistry, physics and computer science, and analyse the use cases to determine the technical requirements of a generic, application-independent architecture. We propose an architecture that meets these requirements and evaluate a preliminary implementation by attempting to realise two of the use cases.
provenance, e-Science, requirements, use cases, Web Services, service-oriented architectures, Grid
Miles, Simon
76c81b8e-1ca1-4d6d-ace3-922f03df97e0
Groth, Paul
427b9eca-c4dd-45c1-be04-3c91bb327345
Branco, Miguel
1e18033c-f6cd-4b54-ae65-a14dc62735f8
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
2005
Miles, Simon
76c81b8e-1ca1-4d6d-ace3-922f03df97e0
Groth, Paul
427b9eca-c4dd-45c1-be04-3c91bb327345
Branco, Miguel
1e18033c-f6cd-4b54-ae65-a14dc62735f8
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Miles, Simon, Groth, Paul, Branco, Miguel and Moreau, Luc
(2005)
The requirements of recording and using provenance in e-Science experiments
Record type:
Monograph
(Project Report)
Abstract
In e-Science experiments, it is vital to record the experimental process for later use such as in interpreting results, verifying that the correct process took place or tracing where data came from. The process that led to some data is called the provenance of that data, and a provenance architecture is the software architecture for a system that will provide the necessary functionality to record, store and use process documentation to determine the provenance of data items. However, there has been little principled analysis of what is actually required of a provenance architecture, so it is impossible to determine the functionality they would ideally support. In this paper, we present use cases for a provenance architecture from current experiments in biology, chemistry, physics and computer science, and analyse the use cases to determine the technical requirements of a generic, application-independent architecture. We propose an architecture that meets these requirements and evaluate a preliminary implementation by attempting to realise two of the use cases.
Text
pasoa04requirements.pdf
- Author's Original
Text
pasoa05requirements.pdf
- Accepted Manuscript
More information
Published date: 2005
Keywords:
provenance, e-Science, requirements, use cases, Web Services, service-oriented architectures, Grid
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 260269
URI: http://eprints.soton.ac.uk/id/eprint/260269
PURE UUID: 8b4fd4b1-3438-4070-ad2c-5eff76f9663d
Catalogue record
Date deposited: 13 Jan 2005
Last modified: 14 Mar 2024 06:35
Export record
Contributors
Author:
Simon Miles
Author:
Paul Groth
Author:
Miguel Branco
Author:
Luc Moreau
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics