The University of Southampton
University of Southampton Institutional Repository

Documentation and visualisation of workflows for effective communication, collaboration and publication @ source

Documentation and visualisation of workflows for effective communication, collaboration and publication @ source
Documentation and visualisation of workflows for effective communication, collaboration and publication @ source
Workflows processing data from research activities and driving in silico experiments are becoming an increasingly important method for conducting scientific research. Workflows have the advantage that not only can they be automated and used to process data repeatedly, but they can also be reused – in part or whole – enabling them to be evolved for use in new experiments. A number of studies have investigated strategies for storing and sharing workflows for the benefit of reuse. These have revealed that simply storing workflows in
repositories without additional context does not enable workflows to be successfully reused. These studies have investigated what additional resources are needed to facilitate users of workflows and in particular to add provenance traces and to make workflows and their resources machine-readable. These additions also include adding metadata for curation, annotations for comprehension, and including data sets to provide additional context to the
workflow. Ultimately though, these mechanisms still rely on researchers having access to the software to view and run the workflows. We argue that there are situations where researchers may want to understand a workflow that goes beyond what provenance traces provide and without having to run the workflow directly; there are many situations in which it can be difficult or impossible to run the original workflow. To that end, we have investigated the creation of an interactive workflow visualization that captures the flow chart element of the
workflow with additional context including annotations, descriptions, parameters, metadata and input, intermediate, and results data that can be added to the record of a workflow experiment to enhance both curation and add value to enable reuse. We have created interactive workflow visualisations for the popular workflow creation tool KNIME, which does not provide users with an in-built function to extract provenance information that can otherwise only be viewed through the tool itself. Making use of the strengths of KNIME for
adding documentation and user-defined metadata we can extract and create a visualisation and curation package that encourages and enhances curation@source, facilitating effective communication, collaboration, and reuse of workflows.
72-87
Willoughby, Cerys
118d1e49-2c54-4f4d-bd49-fe3a192df9d7
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Willoughby, Cerys
118d1e49-2c54-4f4d-bd49-fe3a192df9d7
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f

Willoughby, Cerys and Frey, Jeremy G. (2017) Documentation and visualisation of workflows for effective communication, collaboration and publication @ source. International Journal of Digital Curation, 12 (1), 72-87. (doi:10.2218/ijdc.v12i1.532).

Record type: Article

Abstract

Workflows processing data from research activities and driving in silico experiments are becoming an increasingly important method for conducting scientific research. Workflows have the advantage that not only can they be automated and used to process data repeatedly, but they can also be reused – in part or whole – enabling them to be evolved for use in new experiments. A number of studies have investigated strategies for storing and sharing workflows for the benefit of reuse. These have revealed that simply storing workflows in
repositories without additional context does not enable workflows to be successfully reused. These studies have investigated what additional resources are needed to facilitate users of workflows and in particular to add provenance traces and to make workflows and their resources machine-readable. These additions also include adding metadata for curation, annotations for comprehension, and including data sets to provide additional context to the
workflow. Ultimately though, these mechanisms still rely on researchers having access to the software to view and run the workflows. We argue that there are situations where researchers may want to understand a workflow that goes beyond what provenance traces provide and without having to run the workflow directly; there are many situations in which it can be difficult or impossible to run the original workflow. To that end, we have investigated the creation of an interactive workflow visualization that captures the flow chart element of the
workflow with additional context including annotations, descriptions, parameters, metadata and input, intermediate, and results data that can be added to the record of a workflow experiment to enhance both curation and add value to enable reuse. We have created interactive workflow visualisations for the popular workflow creation tool KNIME, which does not provide users with an in-built function to extract provenance information that can otherwise only be viewed through the tool itself. Making use of the strengths of KNIME for
adding documentation and user-defined metadata we can extract and create a visualisation and curation package that encourages and enhances curation@source, facilitating effective communication, collaboration, and reuse of workflows.

Text
532-2053-1-PB - Version of Record
Download (887kB)

More information

Accepted/In Press date: 23 February 2017
e-pub ahead of print date: 16 September 2017
Published date: 16 September 2017

Identifiers

Local EPrints ID: 414450
URI: http://eprints.soton.ac.uk/id/eprint/414450
PURE UUID: 5fcd906a-8780-4d49-96c6-9cfa1575d314
ORCID for Cerys Willoughby: ORCID iD orcid.org/0000-0003-1721-9212
ORCID for Jeremy G. Frey: ORCID iD orcid.org/0000-0003-0842-4302

Catalogue record

Date deposited: 29 Sep 2017 16:31
Last modified: 10 Apr 2024 01:45

Export record

Altmetrics

Contributors

Author: Cerys Willoughby ORCID iD
Author: Jeremy G. Frey ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×