A notation and system for expressing and executing cleanly typed workflows on messy scientific data
A notation and system for expressing and executing cleanly typed workflows on messy scientific data
The description, composition, and execution of even logically simple scientific workflows are often complicated by the need to deal with "messy'' issues like heterogeneous storage formats and ad-hoc file system structures. We show how these difficulties can be overcome via a typed, compositional workflow notation within which issues of physical representation are cleanly separated from logical typing, and by the implementation of this notation within the context of a powerful runtime system that supports distributed execution. The resulting notation and system are capable both of expressing complex workflows in a simple, compact form, and of enacting those workflows in distributed environments. We apply our technique to cognitive neuroscience workflows that analyze functional MRI image data, and demonstrate significant reductions in code size relative to other approaches.
37-43
Zhao, Yong
1f64ad89-3841-480a-96dc-b472f57eaebc
Dobson, Jed
36221a4c-59d6-411a-8b47-02a33064b1de
Foster, Ian
d2bc443a-7925-4d99-88cb-8877d6ae9cf0
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Wilde, Michael
51f90194-5bb6-4820-a65c-34c922d8a5ff
1 September 2005
Zhao, Yong
1f64ad89-3841-480a-96dc-b472f57eaebc
Dobson, Jed
36221a4c-59d6-411a-8b47-02a33064b1de
Foster, Ian
d2bc443a-7925-4d99-88cb-8877d6ae9cf0
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Wilde, Michael
51f90194-5bb6-4820-a65c-34c922d8a5ff
Zhao, Yong, Dobson, Jed, Foster, Ian, Moreau, Luc and Wilde, Michael
(2005)
A notation and system for expressing and executing cleanly typed workflows on messy scientific data.
SIGMOD Record, 34 (3), .
(doi:10.1145/1084805.1084813).
Abstract
The description, composition, and execution of even logically simple scientific workflows are often complicated by the need to deal with "messy'' issues like heterogeneous storage formats and ad-hoc file system structures. We show how these difficulties can be overcome via a typed, compositional workflow notation within which issues of physical representation are cleanly separated from logical typing, and by the implementation of this notation within the context of a powerful runtime system that supports distributed execution. The resulting notation and system are capable both of expressing complex workflows in a simple, compact form, and of enacting those workflows in distributed environments. We apply our technique to cognitive neuroscience workflows that analyze functional MRI image data, and demonstrate significant reductions in code size relative to other approaches.
Text
sigmod05
- Accepted Manuscript
More information
Published date: 1 September 2005
Organisations:
IAM
Identifiers
Local EPrints ID: 409358
URI: http://eprints.soton.ac.uk/id/eprint/409358
ISSN: 0163-5808
PURE UUID: 70422922-cf2e-428a-94d0-ffa4d0f7cdfb
Catalogue record
Date deposited: 28 May 2017 04:08
Last modified: 15 Mar 2024 12:42
Export record
Altmetrics
Contributors
Author:
Yong Zhao
Author:
Jed Dobson
Author:
Ian Foster
Author:
Luc Moreau
Author:
Michael Wilde
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics