Recording process documentation in the presence of failures in service oriented architectures
Recording process documentation in the presence of failures in service oriented architectures
Scientific and engineering communities (e.g., chemistry, bioinformatics and engineering manufacturing) have presented unprecedented requirements for knowing the provenance of their data products, i.e., where they originated from, how they were produced and what has happened to them since creation. Without such important knowledge, scientists and engineers cannot reproduce, analyse or validate experiments and processes.
Previous work has conceived a computer-based representation of a past process for determining provenance, termed process documentation. However, current provenance systems do not adequately address the problem of reliably recording process documentation in large scale environments like Service Oriented Architectures. For example, a service may not be available and network connection may be broken. In this context, reliably recording process documentation becomes challenging, given that the documentation produced in a process can be spread over multiple provenance repositories across the world.
The presence of failures (specifically, the crash of provenance repositories and communication failures) may prevent process documentation from being recorded, losing the evidence that a process occurred. This would have disastrous consequences and hence is not acceptable in the domains that rely on process documentation to determine the provenance of their data products. In this thesis, we systematically analyse all situations that may occur during capturing process documentation in the event of assumed failures. We then present a novel coordinator-based protocol that is formally proved to record complete process documentation.
In addition, we use graphs to intuitively represent the topology of process documentation recorded in multiple interlinked provenance repositories, which helps us to investigate the entire retrievability of distributed process documentation. Finally, we evaluate a system architecture that employs the protocol and supports practical issues such as communication, storage and performance. The results show that the system can record complete and retrievable process documentation while maintaining acceptable performance.
Chen, Zheng
d07c1910-ab71-4efe-8c09-cd6adea14404
September 2009
Chen, Zheng
d07c1910-ab71-4efe-8c09-cd6adea14404
Moreau, Luc
033c63dd-3fe9-4040-849f-dfccbe0406f8
Chen, Zheng
(2009)
Recording process documentation in the presence of failures in service oriented architectures.
University of Southampton, School of Electronics and Computer Science, Doctoral Thesis, 246pp.
Record type:
Thesis
(Doctoral)
Abstract
Scientific and engineering communities (e.g., chemistry, bioinformatics and engineering manufacturing) have presented unprecedented requirements for knowing the provenance of their data products, i.e., where they originated from, how they were produced and what has happened to them since creation. Without such important knowledge, scientists and engineers cannot reproduce, analyse or validate experiments and processes.
Previous work has conceived a computer-based representation of a past process for determining provenance, termed process documentation. However, current provenance systems do not adequately address the problem of reliably recording process documentation in large scale environments like Service Oriented Architectures. For example, a service may not be available and network connection may be broken. In this context, reliably recording process documentation becomes challenging, given that the documentation produced in a process can be spread over multiple provenance repositories across the world.
The presence of failures (specifically, the crash of provenance repositories and communication failures) may prevent process documentation from being recorded, losing the evidence that a process occurred. This would have disastrous consequences and hence is not acceptable in the domains that rely on process documentation to determine the provenance of their data products. In this thesis, we systematically analyse all situations that may occur during capturing process documentation in the event of assumed failures. We then present a novel coordinator-based protocol that is formally proved to record complete process documentation.
In addition, we use graphs to intuitively represent the topology of process documentation recorded in multiple interlinked provenance repositories, which helps us to investigate the entire retrievability of distributed process documentation. Finally, we evaluate a system architecture that employs the protocol and supports practical issues such as communication, storage and performance. The results show that the system can record complete and retrievable process documentation while maintaining acceptable performance.
Text
Thesis_-_Zheng_Chen.pdf
- Other
More information
Published date: September 2009
Organisations:
University of Southampton
Identifiers
Local EPrints ID: 71951
URI: http://eprints.soton.ac.uk/id/eprint/71951
PURE UUID: b44bd92c-1956-4c80-ae2f-1cddc7092635
Catalogue record
Date deposited: 18 Jan 2010
Last modified: 13 Mar 2024 20:53
Export record
Contributors
Author:
Zheng Chen
Thesis advisor:
Luc Moreau
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics