The University of Southampton
University of Southampton Institutional Repository

Predictive crystallography at scale: mapping, validating, and learning from 1,000 crystal energy landscapes

Predictive crystallography at scale: mapping, validating, and learning from 1,000 crystal energy landscapes
Predictive crystallography at scale: mapping, validating, and learning from 1,000 crystal energy landscapes
Computational crystal structure prediction (CSP) is an increasingly powerful technique in materials discovery, due to its ability to reveal trends and permit insight across the possibility space of crystal structures of a candidate molecule, beyond simply the observed structure(s). In this work, we demonstrate the reliability and scalability of CSP methods for small, rigid organic molecules by performing in-depth CSP investigations for over 1000 such compounds, the largest survey of its kind to-date. We show that this highly-efficient force-field-based CSP approach is superbly predictive, locating 99.4\% of observed experimental structures, and ranking a large majority of these (74\%) as among the most stable possible structures (to within uncertainty due to thermal effects). We present two examples of insights such large predicted datasets can permit, examining the space group preferences of organic molecular crystals and rationalising empirical rules concerning the spontaneous resolution of chiral molecules. Finally, we exploit this large and diverse dataset for developing transferable machine-learned energy potentials for the organic solid state, training a neural network lattice energy correction to force field energies that offers substantial improvements to the already impressive energy rankings, and a MACE equivariant message-passing neural network for crystal structure reoptimisation. We conclude that the excellent performance and reliability of the CSP workflow enables the creation of very large datasets of broad utility and explanatory power in materials design.
0301-7249
Taylor, Christopher R.
95bebf3a-a98a-453c-acb6-aebc451bd5a8
Butler, Patrick W.V.
6e0f7f4a-4cb5-4868-9820-d120c7d905f8
Day, Graeme M.
e3be79ba-ad12-4461-b735-74d5c4355636
Taylor, Christopher R.
95bebf3a-a98a-453c-acb6-aebc451bd5a8
Butler, Patrick W.V.
6e0f7f4a-4cb5-4868-9820-d120c7d905f8
Day, Graeme M.
e3be79ba-ad12-4461-b735-74d5c4355636

Taylor, Christopher R., Butler, Patrick W.V. and Day, Graeme M. (2024) Predictive crystallography at scale: mapping, validating, and learning from 1,000 crystal energy landscapes. Faraday Discussions. (doi:10.1039/D4FD00105B).

Record type: Article

Abstract

Computational crystal structure prediction (CSP) is an increasingly powerful technique in materials discovery, due to its ability to reveal trends and permit insight across the possibility space of crystal structures of a candidate molecule, beyond simply the observed structure(s). In this work, we demonstrate the reliability and scalability of CSP methods for small, rigid organic molecules by performing in-depth CSP investigations for over 1000 such compounds, the largest survey of its kind to-date. We show that this highly-efficient force-field-based CSP approach is superbly predictive, locating 99.4\% of observed experimental structures, and ranking a large majority of these (74\%) as among the most stable possible structures (to within uncertainty due to thermal effects). We present two examples of insights such large predicted datasets can permit, examining the space group preferences of organic molecular crystals and rationalising empirical rules concerning the spontaneous resolution of chiral molecules. Finally, we exploit this large and diverse dataset for developing transferable machine-learned energy potentials for the organic solid state, training a neural network lattice energy correction to force field energies that offers substantial improvements to the already impressive energy rankings, and a MACE equivariant message-passing neural network for crystal structure reoptimisation. We conclude that the excellent performance and reliability of the CSP workflow enables the creation of very large datasets of broad utility and explanatory power in materials design.

Text
d4fd00105b - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (3MB)
Text
Faraday_Discussion_Predict_Crystallogr - Proof
Restricted to Repository staff only
Request a copy

More information

Accepted/In Press date: 22 May 2024
e-pub ahead of print date: 3 June 2024

Identifiers

Local EPrints ID: 490948
URI: http://eprints.soton.ac.uk/id/eprint/490948
ISSN: 0301-7249
PURE UUID: 3b5046b8-0c74-452d-94b7-573c1f10f06a
ORCID for Christopher R. Taylor: ORCID iD orcid.org/0000-0001-9465-5742
ORCID for Graeme M. Day: ORCID iD orcid.org/0000-0001-8396-2771

Catalogue record

Date deposited: 10 Jun 2024 16:38
Last modified: 03 Sep 2024 01:49

Export record

Altmetrics

Contributors

Author: Christopher R. Taylor ORCID iD
Author: Patrick W.V. Butler
Author: Graeme M. Day ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×