The University of Southampton
University of Southampton Institutional Repository

Towards repository preservation services. Final report from the JISC Preserv 2 project

Towards repository preservation services. Final report from the JISC Preserv 2 project
Towards repository preservation services. Final report from the JISC Preserv 2 project
Preserv 2 investigated the preservation of data in digital institutional repositories, focussing in particular on managing storage, data and file formats. Preserv 2 developed the first repository storage controller, which will be a feature of EPrints version 3.2 software (due 2009). Plugin applications that use the controller have been written for Amazon S3 and Sun cloud services among others, as well as for local disk storage. In a breakthrough application Preserv 2 used OAI-ORE to show how data can be moved between two repository softwares with quite distinct data models, from an EPrints repository to a Fedora repository. The largest area of work in Preserv 2 was on file format management and an 'active' preservation approach. This involves identifying file formats, assessing the risks posed by those formats and taking action to obviate the risks where that could be justified. These processes were implemented with reference to a technical registry, PRONOM from The National Archives (TNA), and DROID (digital record object identification service), also produced by TNA. Preserv 2 showed we can invoke a current registry to classify the digital objects and present a hierarchy of risk scores for a repository. Classification was performed using the Preserv2 EPrints preservation toolkit. This 'wraps' DROID in an EPrints repository environment. This toolkit will be another feature available for EPrints v3.2 software. The result of file format identification can indicate a file is at risk of becoming inaccessible or corrupted. Preserv 2 developed a repository interface to present formats by risk category. Providing risk scores through the live PRONOM service was shown to be feasible. Spin-off work is ongoing to develop format risk scores by compiling data from multiple sources in a new linked data registry.
digital repositories, digital preservation, EPrints, Preserv project
Hitchcock, Steve
c0b120a1-439e-43c9-9ba6-647e77f40f3c
Tarrant, David
4aec820b-6055-4f58-abeb-1cc901eb19f2
Carr, Les
0572b10e-039d-46c6-bf05-57cce71d3936
Hitchcock, Steve
c0b120a1-439e-43c9-9ba6-647e77f40f3c
Tarrant, David
4aec820b-6055-4f58-abeb-1cc901eb19f2
Carr, Les
0572b10e-039d-46c6-bf05-57cce71d3936

Hitchcock, Steve, Tarrant, David and Carr, Les (2009) Towards repository preservation services. Final report from the JISC Preserv 2 project (In Press)

Record type: Monograph (Project Report)

Abstract

Preserv 2 investigated the preservation of data in digital institutional repositories, focussing in particular on managing storage, data and file formats. Preserv 2 developed the first repository storage controller, which will be a feature of EPrints version 3.2 software (due 2009). Plugin applications that use the controller have been written for Amazon S3 and Sun cloud services among others, as well as for local disk storage. In a breakthrough application Preserv 2 used OAI-ORE to show how data can be moved between two repository softwares with quite distinct data models, from an EPrints repository to a Fedora repository. The largest area of work in Preserv 2 was on file format management and an 'active' preservation approach. This involves identifying file formats, assessing the risks posed by those formats and taking action to obviate the risks where that could be justified. These processes were implemented with reference to a technical registry, PRONOM from The National Archives (TNA), and DROID (digital record object identification service), also produced by TNA. Preserv 2 showed we can invoke a current registry to classify the digital objects and present a hierarchy of risk scores for a repository. Classification was performed using the Preserv2 EPrints preservation toolkit. This 'wraps' DROID in an EPrints repository environment. This toolkit will be another feature available for EPrints v3.2 software. The result of file format identification can indicate a file is at risk of becoming inaccessible or corrupted. Preserv 2 developed a repository interface to present formats by risk category. Providing risk scores through the live PRONOM service was shown to be feasible. Spin-off work is ongoing to develop format risk scores by compiling data from multiple sources in a new linked data registry.

Text
preserv2-finalreport.pdf - Other
Download (509kB)
Text
preserv2-finalreport.doc - Other
Download (665kB)

More information

Accepted/In Press date: 23 July 2009
Keywords: digital repositories, digital preservation, EPrints, Preserv project
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 268148
URI: http://eprints.soton.ac.uk/id/eprint/268148
PURE UUID: 738f6f62-d5db-46c8-9fc3-08d01f760385
ORCID for Les Carr: ORCID iD orcid.org/0000-0002-2113-9680

Catalogue record

Date deposited: 28 Oct 2009 15:18
Last modified: 15 Mar 2024 02:33

Export record

Contributors

Author: Steve Hitchcock
Author: David Tarrant
Author: Les Carr ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×