Fragment-based energy models and machine learning methods for the computational study of organic molecular crystals
Fragment-based energy models and machine learning methods for the computational study of organic molecular crystals
The focus of this work is to investigate approaches to improving crystal structure prediction (CSP) through the study of different energy models and the use of machine learning.Simple fragment-based models are introduced based on anisotropic force fields, and areobserved to consistently improve the ranking of known polymorphs in CSP landscapes.These models are found to be amenable to machine learning, especially when fragmentenergies are learnt, as opposed to total lattice energies. When applied to absolute energies, fragment-based models including two-body terms using Density Functional Theory(DFT) are found to be largely basis set agnostic. By contrast, models involving secondorder Møller-Plesset perturbation theory (MP2) depend heavily on the basis set used,with three-body terms becoming vital for some systems. Overall, these results highlightthe benefit of simple fragment-based models for CSP, providing a bridge between forcefield approaches and periodic DFT. Through the use of machine learning these models can remain competitive with the computational cost of force fields, allowing for theexploration of CSP landscapes at higher levels of theory.In addition to these models, Gaussian processes are used to learn cocrystal stabilisation energies, where CSP is used to estimate the stabilisation energies of hypotheticalcocrystals. Here CSP is found to be a viable tool for predicting stabilisation energies,though results from machine learning models remain modest.
University of Southampton
McDonagh, David
1ff4dd30-614e-484e-91b9-eef7002377a9
July 2020
McDonagh, David
1ff4dd30-614e-484e-91b9-eef7002377a9
Day, Graeme
e3be79ba-ad12-4461-b735-74d5c4355636
McDonagh, David
(2020)
Fragment-based energy models and machine learning methods for the computational study of organic molecular crystals.
Doctoral Thesis, 295pp.
Record type:
Thesis
(Doctoral)
Abstract
The focus of this work is to investigate approaches to improving crystal structure prediction (CSP) through the study of different energy models and the use of machine learning.Simple fragment-based models are introduced based on anisotropic force fields, and areobserved to consistently improve the ranking of known polymorphs in CSP landscapes.These models are found to be amenable to machine learning, especially when fragmentenergies are learnt, as opposed to total lattice energies. When applied to absolute energies, fragment-based models including two-body terms using Density Functional Theory(DFT) are found to be largely basis set agnostic. By contrast, models involving secondorder Møller-Plesset perturbation theory (MP2) depend heavily on the basis set used,with three-body terms becoming vital for some systems. Overall, these results highlightthe benefit of simple fragment-based models for CSP, providing a bridge between forcefield approaches and periodic DFT. Through the use of machine learning these models can remain competitive with the computational cost of force fields, allowing for theexploration of CSP landscapes at higher levels of theory.In addition to these models, Gaussian processes are used to learn cocrystal stabilisation energies, where CSP is used to estimate the stabilisation energies of hypotheticalcocrystals. Here CSP is found to be a viable tool for predicting stabilisation energies,though results from machine learning models remain modest.
Text
David_McDonagh_PhD_Thesis_Graeme_Day_Group_16_07_20
Restricted to Repository staff only
More information
Published date: July 2020
Identifiers
Local EPrints ID: 447598
URI: http://eprints.soton.ac.uk/id/eprint/447598
PURE UUID: 9eea28ca-9ffc-4cdd-88ac-cd05f0629136
Catalogue record
Date deposited: 16 Mar 2021 17:45
Last modified: 17 Mar 2024 06:10
Export record
Contributors
Author:
David McDonagh
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics