eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data.
eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data.
Recently the funding councils in the UK stated that ‘the data underpinning the published results of publically-funded research should be made available as widely and rapidly as possible’.
Thirty years ago a research student would present about five crystal structures as their PhD thesis, however with modern technologies and good crystals this can now be achieved in the timespan of a single morning. This increase in pace of generation further exacerbates a problem in the communication of the results. Additionally, the general route for the publication of a crystal structure report is coupled with and often governed by the underlying chemistry and is therefore subject to the lengthy peer review process and tied to the timing of the publication as a whole. This bottleneck in the dissemination of crystal structure data hinders the potential growth of databases and the data mining studies that are reliant on these collections. Just 500,000 small unit cell crystal structures are available in the CSD, ICSD & CRYSMET databases, while it is estimated that at least twice this number have been determined in research laboratories and are likely to remain unpublished. In addition, publication in the mainstream literature still offers only indirect (and often subscription controlled) access to this data.
The work of the eBank-UK project (http://www.ukoln.ac.uk/projects/ebank-uk/) has addressed this problem by establishing an institutional data repository that supports, manages and disseminates metadata relating to the crystal structure data it contains (i.e. all the files generated during a crystal structure determination). This process alters the traditional method of peer review by openly providing crystal structure data where the reader or user may directly check correctness and validity. The repository (http://ecrystals.chem.soton.ac.uk) makes available all the raw, derived and results data from a crystallographic experiment with little further researcher effort after the creation of a normal completed structure in a laboratory archive. Not only does this approach allow rapid release of crystal structure data into the public domain, but it can also provide mechanisms for value added services that allow rapid discovery of the data for further studies and reuse, whilst ownership of the data is retained by the creator.
The details of the preparation of data, upload process, files supported and automatic report generation will be presented. Additionally, the process whereby metadata relating to each archive entry is disseminated, using current Digital Libraries technologies, for discovery and reuse by will be summarised. Strategies for the installation of archives at new sites, the construction of harvesting and aggregator services and the interaction with crystallographic data holding bodies, such as IUCr and CCDC, will also be outlined.
Additionally links to educational tools, specifically the Schools eMalaria project (http://emalaria.soton.ac.uk), will also be presented.
electronic publishing, crystallographic databases, computer networking
Coles, Simon J.
3116f58b-c30c-48cf-bdd5-397d1c1fecf8
Hursthouse, Michael B.
57a2ddf9-b1b3-4f38-bfe9-ef2f526388da
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Milsted, Andrew J.
c28cf092-dfbe-488a-a624-4427f68bada2
Carr, Leslie A.
0572b10e-039d-46c6-bf05-57cce71d3936
Koch, Traugott
ebd4727a-7179-440e-a4c7-8cd350dcc399
Lyon, Elizabeth
67ffa18f-39b1-4fac-b420-9b12f7c499d8
Duke, Monica
45ad4501-5a90-41dc-9b2d-524e97291784
Coles, Simon J.
3116f58b-c30c-48cf-bdd5-397d1c1fecf8
Hursthouse, Michael B.
57a2ddf9-b1b3-4f38-bfe9-ef2f526388da
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Milsted, Andrew J.
c28cf092-dfbe-488a-a624-4427f68bada2
Carr, Leslie A.
0572b10e-039d-46c6-bf05-57cce71d3936
Koch, Traugott
ebd4727a-7179-440e-a4c7-8cd350dcc399
Lyon, Elizabeth
67ffa18f-39b1-4fac-b420-9b12f7c499d8
Duke, Monica
45ad4501-5a90-41dc-9b2d-524e97291784
Coles, Simon J., Hursthouse, Michael B., Frey, Jeremy G., Milsted, Andrew J., Carr, Leslie A., Koch, Traugott, Lyon, Elizabeth and Duke, Monica
(2006)
eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data.
ECM 23, Leuven, Belgium.
06 - 10 Aug 2006.
(Submitted)
Record type:
Conference or Workshop Item
(Other)
Abstract
Recently the funding councils in the UK stated that ‘the data underpinning the published results of publically-funded research should be made available as widely and rapidly as possible’.
Thirty years ago a research student would present about five crystal structures as their PhD thesis, however with modern technologies and good crystals this can now be achieved in the timespan of a single morning. This increase in pace of generation further exacerbates a problem in the communication of the results. Additionally, the general route for the publication of a crystal structure report is coupled with and often governed by the underlying chemistry and is therefore subject to the lengthy peer review process and tied to the timing of the publication as a whole. This bottleneck in the dissemination of crystal structure data hinders the potential growth of databases and the data mining studies that are reliant on these collections. Just 500,000 small unit cell crystal structures are available in the CSD, ICSD & CRYSMET databases, while it is estimated that at least twice this number have been determined in research laboratories and are likely to remain unpublished. In addition, publication in the mainstream literature still offers only indirect (and often subscription controlled) access to this data.
The work of the eBank-UK project (http://www.ukoln.ac.uk/projects/ebank-uk/) has addressed this problem by establishing an institutional data repository that supports, manages and disseminates metadata relating to the crystal structure data it contains (i.e. all the files generated during a crystal structure determination). This process alters the traditional method of peer review by openly providing crystal structure data where the reader or user may directly check correctness and validity. The repository (http://ecrystals.chem.soton.ac.uk) makes available all the raw, derived and results data from a crystallographic experiment with little further researcher effort after the creation of a normal completed structure in a laboratory archive. Not only does this approach allow rapid release of crystal structure data into the public domain, but it can also provide mechanisms for value added services that allow rapid discovery of the data for further studies and reuse, whilst ownership of the data is retained by the creator.
The details of the preparation of data, upload process, files supported and automatic report generation will be presented. Additionally, the process whereby metadata relating to each archive entry is disseminated, using current Digital Libraries technologies, for discovery and reuse by will be summarised. Strategies for the installation of archives at new sites, the construction of harvesting and aggregator services and the interaction with crystallographic data holding bodies, such as IUCr and CCDC, will also be outlined.
Additionally links to educational tools, specifically the Schools eMalaria project (http://emalaria.soton.ac.uk), will also be presented.
Slideshow
ECM23_eCrystals.ppt
- Other
More information
Submitted date: 10 August 2006
Venue - Dates:
ECM 23, Leuven, Belgium, 2006-08-06 - 2006-08-10
Keywords:
electronic publishing, crystallographic databases, computer networking
Identifiers
Local EPrints ID: 41257
URI: http://eprints.soton.ac.uk/id/eprint/41257
PURE UUID: 8b5a3f5d-58df-4b21-9adb-55c22f94d09d
Catalogue record
Date deposited: 11 Aug 2006
Last modified: 16 Mar 2024 03:05
Export record
Contributors
Author:
Andrew J. Milsted
Author:
Traugott Koch
Author:
Elizabeth Lyon
Author:
Monica Duke
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics