The Foundations of the Open Provenance Model
The Foundations of the Open Provenance Model
The Open Provenance Model (OPM) is a community-driven data model for Provenance that is designed to support inter-operability of provenance technology. Underpinning OPM, is a notion of directed acyclic graph, used to represent data products and processes involved in past computations, and causal dependencies between these. The Open Provenance Model was derived following two "Provenance Challenges", international, multi-disciplinary activities trying to investigate how to exchange information between multiple systems supporting provenance and how to query it. The OPM design was mostly driven by practical and pragmatic considerations, and is being tested in a third Provenance Challenge, which has just started. The purpose of this paper is to investigate the theoretical foundations of this data model. The formalisation consists of a set-theoretic definition of the data model, a definition of the inferences by transitive closure that are permitted, a formal description of how the model can be used to express dependencies in past computations, and finally, a description of the kind of time-based inferences that are supported. A novel element that OPM introduces is the concept of an account, by which multiple descriptions of a same execution are allowed to co-exist in a same graph. Our formalisation gives a precise meaning to such accounts and associated notions of alternate and refinement.
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Kwasnikowska, Natalia
bd61e3ba-63c8-473e-9791-99ece51ac98d
Van den Bussche, Jan
36be6c66-7250-4a26-8966-f186b289d156
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Kwasnikowska, Natalia
bd61e3ba-63c8-473e-9791-99ece51ac98d
Van den Bussche, Jan
36be6c66-7250-4a26-8966-f186b289d156
Moreau, Luc, Kwasnikowska, Natalia and Van den Bussche, Jan
(2009)
The Foundations of the Open Provenance Model
(In Press)
Record type:
Monograph
(Project Report)
Abstract
The Open Provenance Model (OPM) is a community-driven data model for Provenance that is designed to support inter-operability of provenance technology. Underpinning OPM, is a notion of directed acyclic graph, used to represent data products and processes involved in past computations, and causal dependencies between these. The Open Provenance Model was derived following two "Provenance Challenges", international, multi-disciplinary activities trying to investigate how to exchange information between multiple systems supporting provenance and how to query it. The OPM design was mostly driven by practical and pragmatic considerations, and is being tested in a third Provenance Challenge, which has just started. The purpose of this paper is to investigate the theoretical foundations of this data model. The formalisation consists of a set-theoretic definition of the data model, a definition of the inferences by transitive closure that are permitted, a formal description of how the model can be used to express dependencies in past computations, and finally, a description of the kind of time-based inferences that are supported. A novel element that OPM introduces is the concept of an account, by which multiple descriptions of a same execution are allowed to co-exist in a same graph. Our formalisation gives a precise meaning to such accounts and associated notions of alternate and refinement.
More information
Accepted/In Press date: 20 April 2009
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 267282
URI: http://eprints.soton.ac.uk/id/eprint/267282
PURE UUID: ba0d189b-16cd-425e-aaa5-e8777590d5e9
Catalogue record
Date deposited: 20 Apr 2009 15:21
Last modified: 20 Feb 2024 18:06
Export record
Contributors
Author:
Luc Moreau
Author:
Natalia Kwasnikowska
Author:
Jan Van den Bussche
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics