The requirements of using provenance in e-Science experiments

In e-Science experiments, it is vital to record the experimental process for later use such as in interpreting results, verifying that the correct process took place or tracing where data came from. The process that led to some data is called the provenance of that data, and a provenance architecture is the software architecture for a system that will provide the necessary functionality to record, store and use process documentation to determine the provenance of data items. However, there has been little principled analysis of what is actually required of a provenance architecture, so it is impossible to determine the functionality they would ideally support. In this paper, we present use cases for a provenance architecture from current experiments in biology, chemistry, physics and computer science, and analyse the use cases to determine the technical requirements of a generic, technology and applicationindependent architecture. We propose an architecture that meets these requirements, analyse its features compared with other approaches and evaluate a preliminary implementation by attempting to realise two of the use cases.

provenance, e-Science requirements, use cases

Miles, Simon

76c81b8e-1ca1-4d6d-ace3-922f03df97e0

Groth, Paul

427b9eca-c4dd-45c1-be04-3c91bb327345

Branco, Miguel

1e18033c-f6cd-4b54-ae65-a14dc62735f8

Moreau, Luc

033c63dd-3fe9-4040-849f-dfccbe0406f8

2006

Miles, Simon

76c81b8e-1ca1-4d6d-ace3-922f03df97e0

Groth, Paul

427b9eca-c4dd-45c1-be04-3c91bb327345

Branco, Miguel

1e18033c-f6cd-4b54-ae65-a14dc62735f8

Moreau, Luc

033c63dd-3fe9-4040-849f-dfccbe0406f8

Miles, Simon, Groth, Paul, Branco, Miguel and Moreau, Luc (2006) The requirements of using provenance in e-Science experiments. Journal of Grid Computing.

Record type: Article

Abstract

Text

pasoa04requirements.pdf - Accepted Manuscript

Download (198kB)

More information

Published date: 2006

Keywords: provenance, e-Science requirements, use cases

Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 262566

URI: http://eprints.soton.ac.uk/id/eprint/262566

PURE UUID: f3504729-f0e8-46a4-ac52-13cd2c266acf

ORCID for Luc Moreau:

orcid.org/0000-0002-3494-120X

Catalogue record

Date deposited: 12 May 2006

Last modified: 14 Mar 2024 07:13

Export record

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: Simon Miles

Author: Paul Groth

Author: Miguel Branco

Author: Luc Moreau

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information