Machine learning of quantum mechanical lattice energies for molecular crystal structure prediction
Machine learning of quantum mechanical lattice energies for molecular crystal structure prediction
It is essential to study the crystal phases, i.e. polymorphs, of an organic molecule, under different conditions for the discovery and safety of new functional materials, and the effectiveness and toxicity of pharmaceuticals. This can be avoided by understanding the differences and likely transitions between polymorphs. Crystal Structure Prediction (CSP) is the process of generating crystal structures of a molecule(s). One way of doing this is to apply symmetry operations to a unit cell containing a molecule(s) representing the periodic structure, and optimising the geometries to local energy minima on the potential energy surface (PES) for the lattice. A local optimisation algorithm can be more useful than finding the global minimum of the energy landscape since several low-lying energy polymorphs can be observed.
Due to the number of atoms involved in organic molecular crystals, classical force fields (FFs) are typically used for calculating the lattice energy, and only the most likely, low energy candidates, are re-optimised using a higher level of the hierarchy of electronic structure methods.
The final re-optimisation step using a suitable electronic structure method is crucial since the lattice energy of polymorphs can differ by sub kJ/mol (Nyman et al. (2016); Nyman and Day (2015)) so method errors can significantly change the ranking of the lowest energy polymorphs, particularly if free energies are calculated. This becomes even more problematic due to the thousands of polymorphs in the low energy window of the energy landscape, for example 20 kJ/mol or below.
The aim is to reduce the enormous computational cost of this using Machine Learning (ML). This work will begin by reviewing current CSP methods and ML techniques. A set of thiophenes will become the focus to demonstrate the usefulness of CSP in the discovery of organic semiconductor materials. ML techniques will be applied to the PES to investigate an inexpensive geometry optimisation, where poorly predicted structures may be limiting the accuracy of the energy ranking of the CSP landscape.
University of Southampton
Clements, Rebecca Jane
c92a2d47-ede3-4ce9-bd61-81b8c9681aa6
September 2024
Clements, Rebecca Jane
c92a2d47-ede3-4ce9-bd61-81b8c9681aa6
Day, Graeme
e3be79ba-ad12-4461-b735-74d5c4355636
Woods, Dave
ae21f7e2-29d9-4f55-98a2-639c5e44c79c
Clements, Rebecca Jane
(2024)
Machine learning of quantum mechanical lattice energies for molecular crystal structure prediction.
University of Southampton, Doctoral Thesis, 306pp.
Record type:
Thesis
(Doctoral)
Abstract
It is essential to study the crystal phases, i.e. polymorphs, of an organic molecule, under different conditions for the discovery and safety of new functional materials, and the effectiveness and toxicity of pharmaceuticals. This can be avoided by understanding the differences and likely transitions between polymorphs. Crystal Structure Prediction (CSP) is the process of generating crystal structures of a molecule(s). One way of doing this is to apply symmetry operations to a unit cell containing a molecule(s) representing the periodic structure, and optimising the geometries to local energy minima on the potential energy surface (PES) for the lattice. A local optimisation algorithm can be more useful than finding the global minimum of the energy landscape since several low-lying energy polymorphs can be observed.
Due to the number of atoms involved in organic molecular crystals, classical force fields (FFs) are typically used for calculating the lattice energy, and only the most likely, low energy candidates, are re-optimised using a higher level of the hierarchy of electronic structure methods.
The final re-optimisation step using a suitable electronic structure method is crucial since the lattice energy of polymorphs can differ by sub kJ/mol (Nyman et al. (2016); Nyman and Day (2015)) so method errors can significantly change the ranking of the lowest energy polymorphs, particularly if free energies are calculated. This becomes even more problematic due to the thousands of polymorphs in the low energy window of the energy landscape, for example 20 kJ/mol or below.
The aim is to reduce the enormous computational cost of this using Machine Learning (ML). This work will begin by reviewing current CSP methods and ML techniques. A set of thiophenes will become the focus to demonstrate the usefulness of CSP in the discovery of organic semiconductor materials. ML techniques will be applied to the PES to investigate an inexpensive geometry optimisation, where poorly predicted structures may be limiting the accuracy of the energy ranking of the CSP landscape.
Text
Clements_thesis_final
- Version of Record
Text
Final-thesis-submission-Examination-Miss-Rebecca-Clements (1)
Restricted to Repository staff only
More information
Published date: September 2024
Identifiers
Local EPrints ID: 494056
URI: http://eprints.soton.ac.uk/id/eprint/494056
PURE UUID: 5363b457-3635-41e9-b363-59e67f70ddf1
Catalogue record
Date deposited: 20 Sep 2024 16:50
Last modified: 10 Jan 2025 02:50
Export record
Contributors
Author:
Rebecca Jane Clements
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics