The University of Southampton
University of Southampton Institutional Repository

A repository based framework for capture, management, curation and dissemination of research data

A repository based framework for capture, management, curation and dissemination of research data
A repository based framework for capture, management, curation and dissemination of research data
Based on the e-Bank-UK (http://www.ukoln.ac.uk/projects/ebank-uk) and Repository for the Laboratory, R4L (http://r4l.eprints.org) projects, a working model for a scientific data capture, management, curation and dissemination framework will be presented. The eCrystals repository has been constructed on an institutional repository platform and has been configured to ingest small molecule crystallographic data generated by the UK National Crystallography Service, whilst the R4L repository supports a range of different types of analytical chemistry data. This model addresses the current escalating ‘data deluge’ problem through integration of digital libraries technologies with both the research laboratory and also with established publication and dissemination routes. The institutional model provides a potential mechanism for the long term archival and availability of information in a manner that enables the capture of its research data output through integration into the laboratory environment. The repository ingest process ensures full capture of laboratory data and effective metadata creation at the point it is generated. A private archive provides effective management of the data, whilst an embargo procedure allows dissemination of results through a public archive in a timely manner. A schema for the dissemination of crystallographic data has been devised through consultation with the community which enables effective harvesting by data centres and third party aggregator services. The use of persistent identifiers provides a mechanism to permanently link the conventional scholarly article with its associated underlying dataset. Current work is investigating the issues associated with the construction of a federation of data repositories (institutional and subject based) operating on different software platforms and its long term integration into the publishing and chemical information provision processes.
Coles, Simon J.
3116f58b-c30c-48cf-bdd5-397d1c1fecf8
Coles, Simon J.
3116f58b-c30c-48cf-bdd5-397d1c1fecf8

Coles, Simon J. (2007) A repository based framework for capture, management, curation and dissemination of research data. The 2007 Microsoft eScience Workshop at RENCI, Chapel Hill, USA. 21 - 23 Oct 2007. (Submitted)

Record type: Conference or Workshop Item (Other)

Abstract

Based on the e-Bank-UK (http://www.ukoln.ac.uk/projects/ebank-uk) and Repository for the Laboratory, R4L (http://r4l.eprints.org) projects, a working model for a scientific data capture, management, curation and dissemination framework will be presented. The eCrystals repository has been constructed on an institutional repository platform and has been configured to ingest small molecule crystallographic data generated by the UK National Crystallography Service, whilst the R4L repository supports a range of different types of analytical chemistry data. This model addresses the current escalating ‘data deluge’ problem through integration of digital libraries technologies with both the research laboratory and also with established publication and dissemination routes. The institutional model provides a potential mechanism for the long term archival and availability of information in a manner that enables the capture of its research data output through integration into the laboratory environment. The repository ingest process ensures full capture of laboratory data and effective metadata creation at the point it is generated. A private archive provides effective management of the data, whilst an embargo procedure allows dissemination of results through a public archive in a timely manner. A schema for the dissemination of crystallographic data has been devised through consultation with the community which enables effective harvesting by data centres and third party aggregator services. The use of persistent identifiers provides a mechanism to permanently link the conventional scholarly article with its associated underlying dataset. Current work is investigating the issues associated with the construction of a federation of data repositories (institutional and subject based) operating on different software platforms and its long term integration into the publishing and chemical information provision processes.

Slideshow
SJC_MS_eScience_workshop.pptx - Other
Download (5MB)

More information

Submitted date: 2 November 2007
Venue - Dates: The 2007 Microsoft eScience Workshop at RENCI, Chapel Hill, USA, 2007-10-21 - 2007-10-23

Identifiers

Local EPrints ID: 49395
URI: http://eprints.soton.ac.uk/id/eprint/49395
PURE UUID: 210199ff-1192-440b-aa83-7c3a22d6162f
ORCID for Simon J. Coles: ORCID iD orcid.org/0000-0001-8414-9272

Catalogue record

Date deposited: 07 Nov 2007
Last modified: 16 Mar 2024 03:05

Export record

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×