The University of Southampton
University of Southampton Institutional Repository

eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data.

eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data.
eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data.
Recently the funding councils in the UK stated that ‘the data underpinning the published results of publically-funded research should be made available as widely and rapidly as possible’. Thirty years ago a research student would present about five crystal structures as their PhD thesis, however with modern technologies and good crystals this can now be achieved in the timespan of a single morning. This increase in pace of generation further exacerbates a problem in the communication of the results. Additionally, the general route for the publication of a crystal structure report is coupled with and often governed by the underlying chemistry and is therefore subject to the lengthy peer review process and tied to the timing of the publication as a whole. This bottleneck in the dissemination of crystal structure data hinders the potential growth of databases and the data mining studies that are reliant on these collections. Just 500,000 small unit cell crystal structures are available in the CSD, ICSD & CRYSMET databases, while it is estimated that at least twice this number have been determined in research laboratories and are likely to remain unpublished. In addition, publication in the mainstream literature still offers only indirect (and often subscription controlled) access to this data. The work of the eBank-UK project (http://www.ukoln.ac.uk/projects/ebank-uk/) has addressed this problem by establishing an institutional data repository that supports, manages and disseminates metadata relating to the crystal structure data it contains (i.e. all the files generated during a crystal structure determination). This process alters the traditional method of peer review by openly providing crystal structure data where the reader or user may directly check correctness and validity. The repository (http://ecrystals.chem.soton.ac.uk) makes available all the raw, derived and results data from a crystallographic experiment with little further researcher effort after the creation of a normal completed structure in a laboratory archive. Not only does this approach allow rapid release of crystal structure data into the public domain, but it can also provide mechanisms for value added services that allow rapid discovery of the data for further studies and reuse, whilst ownership of the data is retained by the creator. The details of the preparation of data, upload process, files supported and automatic report generation will be presented. Additionally, the process whereby metadata relating to each archive entry is disseminated, using current Digital Libraries technologies, for discovery and reuse by will be summarised. Strategies for the installation of archives at new sites, the construction of harvesting and aggregator services and the interaction with crystallographic data holding bodies, such as IUCr and CCDC, will also be outlined. Additionally links to educational tools, specifically the Schools eMalaria project (http://emalaria.soton.ac.uk), will also be presented.
electronic publishing, crystallographic databases, computer networking
Coles, Simon J.
3116f58b-c30c-48cf-bdd5-397d1c1fecf8
Hursthouse, Michael B.
57a2ddf9-b1b3-4f38-bfe9-ef2f526388da
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Milsted, Andrew J.
c28cf092-dfbe-488a-a624-4427f68bada2
Carr, Leslie A.
0572b10e-039d-46c6-bf05-57cce71d3936
Koch, Traugott
ebd4727a-7179-440e-a4c7-8cd350dcc399
Lyon, Elizabeth
67ffa18f-39b1-4fac-b420-9b12f7c499d8
Duke, Monica
45ad4501-5a90-41dc-9b2d-524e97291784
Coles, Simon J.
3116f58b-c30c-48cf-bdd5-397d1c1fecf8
Hursthouse, Michael B.
57a2ddf9-b1b3-4f38-bfe9-ef2f526388da
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Milsted, Andrew J.
c28cf092-dfbe-488a-a624-4427f68bada2
Carr, Leslie A.
0572b10e-039d-46c6-bf05-57cce71d3936
Koch, Traugott
ebd4727a-7179-440e-a4c7-8cd350dcc399
Lyon, Elizabeth
67ffa18f-39b1-4fac-b420-9b12f7c499d8
Duke, Monica
45ad4501-5a90-41dc-9b2d-524e97291784

Coles, Simon J., Hursthouse, Michael B., Frey, Jeremy G., Milsted, Andrew J., Carr, Leslie A., Koch, Traugott, Lyon, Elizabeth and Duke, Monica (2006) eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data. ECM 23, Leuven, Belgium. 06 - 10 Aug 2006. (Submitted)

Record type: Conference or Workshop Item (Other)

Abstract

Recently the funding councils in the UK stated that ‘the data underpinning the published results of publically-funded research should be made available as widely and rapidly as possible’. Thirty years ago a research student would present about five crystal structures as their PhD thesis, however with modern technologies and good crystals this can now be achieved in the timespan of a single morning. This increase in pace of generation further exacerbates a problem in the communication of the results. Additionally, the general route for the publication of a crystal structure report is coupled with and often governed by the underlying chemistry and is therefore subject to the lengthy peer review process and tied to the timing of the publication as a whole. This bottleneck in the dissemination of crystal structure data hinders the potential growth of databases and the data mining studies that are reliant on these collections. Just 500,000 small unit cell crystal structures are available in the CSD, ICSD & CRYSMET databases, while it is estimated that at least twice this number have been determined in research laboratories and are likely to remain unpublished. In addition, publication in the mainstream literature still offers only indirect (and often subscription controlled) access to this data. The work of the eBank-UK project (http://www.ukoln.ac.uk/projects/ebank-uk/) has addressed this problem by establishing an institutional data repository that supports, manages and disseminates metadata relating to the crystal structure data it contains (i.e. all the files generated during a crystal structure determination). This process alters the traditional method of peer review by openly providing crystal structure data where the reader or user may directly check correctness and validity. The repository (http://ecrystals.chem.soton.ac.uk) makes available all the raw, derived and results data from a crystallographic experiment with little further researcher effort after the creation of a normal completed structure in a laboratory archive. Not only does this approach allow rapid release of crystal structure data into the public domain, but it can also provide mechanisms for value added services that allow rapid discovery of the data for further studies and reuse, whilst ownership of the data is retained by the creator. The details of the preparation of data, upload process, files supported and automatic report generation will be presented. Additionally, the process whereby metadata relating to each archive entry is disseminated, using current Digital Libraries technologies, for discovery and reuse by will be summarised. Strategies for the installation of archives at new sites, the construction of harvesting and aggregator services and the interaction with crystallographic data holding bodies, such as IUCr and CCDC, will also be outlined. Additionally links to educational tools, specifically the Schools eMalaria project (http://emalaria.soton.ac.uk), will also be presented.

Slideshow
ECM23_eCrystals.ppt - Other
Download (6MB)

More information

Submitted date: 10 August 2006
Venue - Dates: ECM 23, Leuven, Belgium, 2006-08-06 - 2006-08-10
Keywords: electronic publishing, crystallographic databases, computer networking

Identifiers

Local EPrints ID: 41257
URI: http://eprints.soton.ac.uk/id/eprint/41257
PURE UUID: 8b5a3f5d-58df-4b21-9adb-55c22f94d09d
ORCID for Simon J. Coles: ORCID iD orcid.org/0000-0001-8414-9272
ORCID for Jeremy G. Frey: ORCID iD orcid.org/0000-0003-0842-4302
ORCID for Leslie A. Carr: ORCID iD orcid.org/0000-0002-2113-9680

Catalogue record

Date deposited: 11 Aug 2006
Last modified: 16 Mar 2024 03:05

Export record

Contributors

Author: Simon J. Coles ORCID iD
Author: Jeremy G. Frey ORCID iD
Author: Andrew J. Milsted
Author: Leslie A. Carr ORCID iD
Author: Traugott Koch
Author: Elizabeth Lyon
Author: Monica Duke

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×