Recording and Using Provenance in a Protein Compressibility Experiment
Groth, Paul, Miles, Simon, Fang, Weijan, Wong, Sylvia C., Zauner, Klaus-Peter and Moreau, Luc (2005) Recording and Using Provenance in a Protein Compressibility Experiment. In, The 14th IEEE International Symposium on High Performance Distributed Computing (HPDC-14), Research Triangle Park, North Carolina, 24 - 27 Jul 2005.
Download
|
PDF
Download (162Kb) |
Description/Abstract
Very large scale computations are now becoming routinely used as a methodology to undertake scientific research. In this context, ‘provenance systems’ are regarded as the equivalent of the scientist’s logbook for in silico experimentation: provenance captures the documentation of the process that led to some result. Using a protein compressibility analysis application, we derive a set of generic use cases for a provenance system. In order to support these, we address the following fundamental questions: what is provenance? how to record it? what is the performance impact for grid execution? what is the performance of reasoning? In doing so, we define a technologyindependent notion of provenance that captures interactions between components, internal component information and grouping of interactions, so as to allow us to analyse and reason about the execution of scientific processes. In order to support persistent provenance in heterogeneous applications, we introduce a separate provenance store, in which provenance documentation can be stored, archived and queried independently of the technology used to run the application. Through a series of practical tests, we evaluate the performance impact of such a provenance system. In summary, we demonstrate that provenance recording overhead of our prototype system remains under 10% of execution time, and we show that the recorded information successfully supports our use cases in a performant manner.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Additional Information: | Event Dates: 24-27 July, 2005 |
| Keywords: | Provenance, Grid, protein compressibility |
| Divisions: | Faculty of Physical and Applied Science > Electronics and Computer Science > Web & Internet Science Faculty of Physical and Applied Science > Electronics and Computer Science > Agents, Interactions & Complexity |
| Item ID: | 260910 |
| Date Deposited: | 24 May 2005 |
| Last Modified: | 26 Apr 2013 03:25 |
| Contributors: | Groth, Paul (Author) Miles, Simon (Author) Fang, Weijan (Author) Wong, Sylvia C. (Author) Zauner, Klaus-Peter (Author) Moreau, Luc (Author) |
| Date: | 2005 |
| Additional Information: | Event Dates: 24-27 July, 2005 |
| Status: | Published |
| Further Information: | Google Scholar |
| ISI Citation Count: | 1 |
| URI: | http://eprints.soton.ac.uk/id/eprint/260910 |
Actions (login required)
![]() |
View Item |


