The University of Southampton
University of Southampton Institutional Repository

A model of process documentation to determine provenance in mash-ups

Groth, Paul, Miles, Simon and Moreau, Luc (2009) A model of process documentation to determine provenance in mash-ups ACM Transactions on Internet Technology, 9, (1), 3:1-3:31.

Record type: Article


Through technologies such as RSS (Really Simple Syndication), Web Services, and AJAX (Asynchronous JavaScript And XML), the Internet has facilitated the emergence of applications that are composed from a variety of services and data sources. Through tools such as Yahoo Pipes, these "mash-ups" can be composed in a dynamic, just-in-time manner from components provided by multiple institutions (i.e. Google, Amazon, your neighbour). However, when using these applications, it is not apparent where data comes from or how it is processed. Thus, to inspire trust and confidence in mash-ups, it is critical to be able to analyse their processes after the fact. These trailing analyses, in particular the determination of the provenance of a result (i.e. the process that led to it), are enabled by process documentation, which is documentation of an application's past process created by the components of that application at execution time. In this paper, we define a generic conceptual data model that supports the autonomous creation of attributable, factual process documentation for dynamic multi-institutional applications. The data model is instantiated using two Internet formats, OWL and XML, and is evaluated with respect to questions about the provenance of results generated by a complex bioinformatics mash-up.

PDF toit09.pdf - Version of Record
Download (908kB)

More information

Published date: February 2009
Keywords: process, process documentation, provenance, data model, concept maps, mash-ups
Organisations: Web & Internet Science


Local EPrints ID: 270861
ISSN: 1533-5399
PURE UUID: 36eaaaa8-4bbd-444d-be48-8be28a45e3a2
ORCID for Luc Moreau: ORCID iD

Catalogue record

Date deposited: 20 Apr 2010 22:07
Last modified: 22 Oct 2017 02:13

Export record


Author: Paul Groth
Author: Simon Miles
Author: Luc Moreau ORCID iD

University divisions

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton:

ePrints Soton supports OAI 2.0 with a base URL of

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.