A model of process documentation to determine provenance in mash-ups

Groth, Paul, Miles, Simon and Moreau, Luc (2009) A model of process documentation to determine provenance in mash-ups. ACM Transactions on Internet Technology (TOIT), 9, (1), 3:1-3:31. (doi:10.1145/1462159.1462162).


[img] PDF - Version of Record
Download (887Kb)


Through technologies such as RSS (Really Simple Syndication), Web Services, and AJAX (Asynchronous JavaScript And XML), the Internet has facilitated the emergence of applications that are composed from a variety of services and data sources. Through tools such as Yahoo Pipes, these ``mash-ups'' can be composed in a dynamic, just-in-time manner from components provided by multiple institutions (i.e. Google, Amazon, your neighbour). However, when using these applications, it is not apparent where data comes from or how it is processed. Thus, to inspire trust and confidence in mash-ups, it is critical to be able to analyse their processes after the fact. These trailing analyses, in particular the determination of the provenance of a result (i.e. the process that led to it), are enabled by process documentation, which is documentation of an application's past process created by the components of that application at execution time. In this paper, we define a generic conceptual data model that supports the autonomous creation of attributable, factual process documentation for dynamic multi-institutional applications. The data model is instantiated using two Internet formats, OWL and XML, and is evaluated with respect to questions about the provenance of results generated by a complex bioinformatics mash-up.

Item Type: Article
Digital Object Identifier (DOI): doi:10.1145/1462159.1462162
ISSNs: 1533-5399 (print)
Keywords: process, process documentation, provenance, data model, concept maps, mash-ups
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions : Faculty of Physical Sciences and Engineering > Electronics and Computer Science > Web & Internet Science
ePrint ID: 270861
Accepted Date and Publication Date:
February 2009Published
Date Deposited: 20 Apr 2010 22:07
Last Modified: 31 Mar 2016 14:17
PASOA: Provenance Aware Service Oriented Architecture
Funded by: EPSRC (GR/S67623/01)
1 February 2004 to 30 June 2007
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/270861

Actions (login required)

View Item View Item

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics