The University of Southampton
University of Southampton Institutional Repository

Dataset: CSP-generated crystal structures of 1,000+ rigid organic molecules

Dataset: CSP-generated crystal structures of 1,000+ rigid organic molecules
Dataset: CSP-generated crystal structures of 1,000+ rigid organic molecules
This dataset supports the publication: AUTHORS: Christopher R. Taylor, Patrick W. V. Butler, Graeme M. Day TITLE: Predictive crystallography at scale: mapping, validating, and learning from 1,000 crystal energy landscapes JOURNAL: Faraday Discussions A consolidated dataset of crystal structure predictions (CSPs) for 1007 unique rigid, organic molecules with observed crystal structures in the Cambridge Structural Database (CSD). Each CSP is described by a "landscape" of hypothetical crystal structures, ranked in terms of their lattice energy; this dataset includes both the crystal structures themselves and their energy rankings. This dataset also includes two machine-learning-derived models to improve the energy ranking of crystal structures on their respective landscapes; one a committee neural-network potential (NNP) to correct energies of fixed structures, the other a message-passing neural-network (MACE) model used to re-optimise particularly difficult crystal structures.
crystal structure prediction, materials discovery, crystallography, materials screening, solid-state chemistry, computational chemistry, polymorphism, Machine Learning
University of Southampton
Taylor, Christopher
95bebf3a-a98a-453c-acb6-aebc451bd5a8
Day, Graeme
e3be79ba-ad12-4461-b735-74d5c4355636
Butler, Patrick Walter Villers
6e0f7f4a-4cb5-4868-9820-d120c7d905f8
Taylor, Christopher
95bebf3a-a98a-453c-acb6-aebc451bd5a8
Day, Graeme
e3be79ba-ad12-4461-b735-74d5c4355636
Butler, Patrick Walter Villers
6e0f7f4a-4cb5-4868-9820-d120c7d905f8

Taylor, Christopher, Day, Graeme and Butler, Patrick Walter Villers (2024) Dataset: CSP-generated crystal structures of 1,000+ rigid organic molecules. University of Southampton doi:10.5258/SOTON/D3094 [Dataset]

Record type: Dataset

Abstract

This dataset supports the publication: AUTHORS: Christopher R. Taylor, Patrick W. V. Butler, Graeme M. Day TITLE: Predictive crystallography at scale: mapping, validating, and learning from 1,000 crystal energy landscapes JOURNAL: Faraday Discussions A consolidated dataset of crystal structure predictions (CSPs) for 1007 unique rigid, organic molecules with observed crystal structures in the Cambridge Structural Database (CSD). Each CSP is described by a "landscape" of hypothetical crystal structures, ranked in terms of their lattice energy; this dataset includes both the crystal structures themselves and their energy rankings. This dataset also includes two machine-learning-derived models to improve the energy ranking of crystal structures on their respective landscapes; one a committee neural-network potential (NNP) to correct energies of fixed structures, the other a message-passing neural-network (MACE) model used to re-optimise particularly difficult crystal structures.

Text
D3094-README.txt - Text
Available under License Creative Commons Attribution.
Download (4kB)
Text
chiral.txt - Text
Available under License Creative Commons Attribution.
Download (2kB)
Archive
Delta-ML_Corr_Landscapes.zip - Dataset
Available under License Creative Commons Attribution.
Download (195MB)
Archive
MACE_Reopt_Landscapes.zip - Dataset
Available under License Creative Commons Attribution.
Download (1GB)
Archive
MACE_total_energy_model.zip - Dataset
Available under License Creative Commons Attribution.
Download (1MB)
Archive
NNP_correction_model.zip - Dataset
Available under License Creative Commons Attribution.
Download (7MB)
Text
non-chiral.txt - Text
Available under License Creative Commons Attribution.
Download (4kB)
Text
ranks_of_expt_matches.csv - Dataset
Available under License Creative Commons Attribution.
Download (75kB)
Text
sohncke.txt - Text
Available under License Creative Commons Attribution.
Download (826B)
Text
unique_names.txt - Text
Available under License Creative Commons Attribution.
Download (7kB)
Archive
FIT-DMA_Landscapes.zip - Dataset
Available under License Creative Commons Attribution.
Download (2GB)

Show all 11 downloads.

More information

Published date: 29 May 2024
Keywords: crystal structure prediction, materials discovery, crystallography, materials screening, solid-state chemistry, computational chemistry, polymorphism, Machine Learning

Identifiers

Local EPrints ID: 490717
URI: http://eprints.soton.ac.uk/id/eprint/490717
PURE UUID: a5c5b222-da8c-4590-b1c9-3504f3fc18e9
ORCID for Christopher Taylor: ORCID iD orcid.org/0000-0001-9465-5742
ORCID for Graeme Day: ORCID iD orcid.org/0000-0001-8396-2771

Catalogue record

Date deposited: 04 Jun 2024 16:42
Last modified: 17 Jul 2024 01:48

Export record

Altmetrics

Contributors

Creator: Christopher Taylor ORCID iD
Creator: Graeme Day ORCID iD
Creator: Patrick Walter Villers Butler

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×