Evaluating and improving the robustness of alchemical binding free energy calculations using adaptive enhanced sampling methods
Evaluating and improving the robustness of alchemical binding free energy calculations using adaptive enhanced sampling methods
Alchemical protein–ligand binding free energy calculations are currently a topic in computational chemistry which requires expert knowledge and a multitude of initial choices and parameters set by the researcher. While the impact of many of these decisions on the resulting free energy values has been explored in recent years, the influence of the initial protein crystal structure, as well as the protonation, tautomeric and rotameric states of the amino acid side chains on the free energy values have been underexplored. To perform these studies, a Python library (ProtoCaller) supporting an arbitrary level of automation for setting up and running binding free energy calculations is first developed and presented. Afterwards, it is shown that the choice of initial protein crystal structure can significantly impact the resulting free energy values at short timescales, while ligand rare events can induce discrepancies at longer timescales. Similarly, different initial histidine protonation, tautomeric and rotameric states can also result in free energy discrepancies, showcasing the need for enhanced sampling methods on protein and ligand degrees of freedom. To address these sampling problems, an alchemical variant of the sequential Monte Carlo (SMC) enhanced sampling method is presented and validated on a range of test cases. This methodology is then augmented with long-timescale sampling provided by simulated tempering (ST), whose initial parameters are obtained from a preliminary exploratory SMC simulation and are afterwards refined over time in an adaptive fashion. The resulting method—fully adaptive simulated tempering (FAST)—is completely automatable and does not require any system-dependent parameters, making it generally applicable to the ligand sampling problem. Finally, FAST is applied to relative protein–ligand binding free energy calculations, enabling their full automation in combination with adaptive enhanced sampling. This methodology improves free energy reproducibility by decreasing the number of initial choices made by the researcher and can also be readily generalised to other sampling scenarios, making it a highly relevant contribution to the field.
University of Southampton
Suruzhon, Miroslav
4ea4dd8b-0a98-4598-9eaa-756d943b5dca
Suruzhon, Miroslav
4ea4dd8b-0a98-4598-9eaa-756d943b5dca
Essex, Jonathan
1f409cfe-6ba4-42e2-a0ab-a931826314b5
Suruzhon, Miroslav
(2022)
Evaluating and improving the robustness of alchemical binding free energy calculations using adaptive enhanced sampling methods.
University of Southampton, Doctoral Thesis, 235pp.
Record type:
Thesis
(Doctoral)
Abstract
Alchemical protein–ligand binding free energy calculations are currently a topic in computational chemistry which requires expert knowledge and a multitude of initial choices and parameters set by the researcher. While the impact of many of these decisions on the resulting free energy values has been explored in recent years, the influence of the initial protein crystal structure, as well as the protonation, tautomeric and rotameric states of the amino acid side chains on the free energy values have been underexplored. To perform these studies, a Python library (ProtoCaller) supporting an arbitrary level of automation for setting up and running binding free energy calculations is first developed and presented. Afterwards, it is shown that the choice of initial protein crystal structure can significantly impact the resulting free energy values at short timescales, while ligand rare events can induce discrepancies at longer timescales. Similarly, different initial histidine protonation, tautomeric and rotameric states can also result in free energy discrepancies, showcasing the need for enhanced sampling methods on protein and ligand degrees of freedom. To address these sampling problems, an alchemical variant of the sequential Monte Carlo (SMC) enhanced sampling method is presented and validated on a range of test cases. This methodology is then augmented with long-timescale sampling provided by simulated tempering (ST), whose initial parameters are obtained from a preliminary exploratory SMC simulation and are afterwards refined over time in an adaptive fashion. The resulting method—fully adaptive simulated tempering (FAST)—is completely automatable and does not require any system-dependent parameters, making it generally applicable to the ligand sampling problem. Finally, FAST is applied to relative protein–ligand binding free energy calculations, enabling their full automation in combination with adaptive enhanced sampling. This methodology improves free energy reproducibility by decreasing the number of initial choices made by the researcher and can also be readily generalised to other sampling scenarios, making it a highly relevant contribution to the field.
Text
Suruzhon, M. PhD thesis
- Version of Record
Text
Suruzhon, M. Permission to deposit thesis - form
- Version of Record
Restricted to Repository staff only
More information
Submitted date: May 2022
Identifiers
Local EPrints ID: 467587
URI: http://eprints.soton.ac.uk/id/eprint/467587
PURE UUID: 52f3684e-1d1e-44b9-acf0-fd0ebe0b0e07
Catalogue record
Date deposited: 14 Jul 2022 17:11
Last modified: 17 Mar 2024 07:23
Export record
Contributors
Author:
Miroslav Suruzhon
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics