The University of Southampton
University of Southampton Institutional Repository

Applying the Provenance Data Model to a Bioinformatics Case

Applying the Provenance Data Model to a Bioinformatics Case
Applying the Provenance Data Model to a Bioinformatics Case
Scientists and, more generally end users of computer systems, need to be able to trust the data they use. Understanding the origin or provenance of data can provide this trust. Attempts have been made to develop systems for recording provenance, however, most are not generic and cannot be applied in a general manner across different systems and different technologies. Moreover, many existing systems confuse the concept of provenance with its representation. In this article, we discuss an open, technology neutral model for provenance. The model can be applied across many different systems and makes the important distinction between provenance and the way it can be generated from a concrete representation of process. The model is described and applied to a grid-based example bioinformatics application.
250-264
IOS Press
Groth, Paul
6a6f0727-809b-49be-b305-4dbb61ad7f2c
Munroe, Steve
499e7ff6-0f0d-400e-9a62-4958e95a93e4
Miles, Simon
76c81b8e-1ca1-4d6d-ace3-922f03df97e0
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Grandinetti, Lucio
Groth, Paul
6a6f0727-809b-49be-b305-4dbb61ad7f2c
Munroe, Steve
499e7ff6-0f0d-400e-9a62-4958e95a93e4
Miles, Simon
76c81b8e-1ca1-4d6d-ace3-922f03df97e0
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Grandinetti, Lucio

Groth, Paul, Munroe, Steve, Miles, Simon and Moreau, Luc (2008) Applying the Provenance Data Model to a Bioinformatics Case. In, Grandinetti, Lucio (ed.) High Performance Computing and Grids in Action. (Advances in Parallel Computing, 16) IOS Press, pp. 250-264.

Record type: Book Section

Abstract

Scientists and, more generally end users of computer systems, need to be able to trust the data they use. Understanding the origin or provenance of data can provide this trust. Attempts have been made to develop systems for recording provenance, however, most are not generic and cannot be applied in a general manner across different systems and different technologies. Moreover, many existing systems confuse the concept of provenance with its representation. In this article, we discuss an open, technology neutral model for provenance. The model can be applied across many different systems and makes the important distinction between provenance and the way it can be generated from a concrete representation of process. The model is described and applied to a grid-based example bioinformatics application.

Text
hpc08 - Accepted Manuscript
Download (704kB)

More information

Published date: 1 January 2008
Organisations: IAM, Electronics & Computer Science

Identifiers

Local EPrints ID: 409277
URI: http://eprints.soton.ac.uk/id/eprint/409277
PURE UUID: 861449fd-61cd-45c3-89be-f503951adc19
ORCID for Luc Moreau: ORCID iD orcid.org/0000-0002-3494-120X

Catalogue record

Date deposited: 28 May 2017 04:07
Last modified: 15 Mar 2024 12:49

Export record

Contributors

Author: Paul Groth
Author: Steve Munroe
Author: Simon Miles
Author: Luc Moreau ORCID iD
Editor: Lucio Grandinetti

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×