AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not
AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not
With simulating the dynamic behaviour of ever bigger molecular systems for longer simulation time we simultaneously achieve more realistic timescales and gain much better insights into the physiological relevant time dependent behaviour of molecular systems, but also generate significantly more data and thus pose new challenges for filtering noise and analysing the simulation data. In simulation analysis and data dimensionality reduction we often rely on linear dependencies and behaviour within the simulated timespan. This generally is true for systems that show slow structural or conformational transitions over the simulated timespan. For example, proteins and enzyme simulations that undergo large scale conformational changes can adequately be analysed by methods of principle component analysis (PCA) and analysing and visualising low frequency normal modes. However, much more flexible molecular systems undergoing multiple and seemingly chaotic conformational changes are posing challenges for their analysis. Here, we need to advance our analysis toolbox. Moreover, it is important to understand the flexibility and linear behaviour of the simulated system before choosing the analysis methods. In this talk we present three very differently behaving molecular systems including an enzymatic activation process, a flexible self-assembling host guest system, and a highly flexible dataset of lipid molecules relevant for antibiotic resistance of mycobacterium tuberculosis. We will show how chaos theory can help to understand the flexibility patterns of the simulations, then present classical PCA based simulation analysis, before introducing the opportunities and challenges for unsupervised machine learning methodologies. The presented methods will concentrate on competitive learning methods (self-organizing maps) and density based clustering algorithms (DB scan) to analysis dominating and hidden structural features in seemingly chaotic simulation data.
Jäger, Christof
1a4211d3-fa12-4005-b142-cd435cdad0cd
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Kanza, Samantha
b73bcf34-3ff8-4691-bd09-aa657dcff420
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
2 March 2022
Jäger, Christof
1a4211d3-fa12-4005-b142-cd435cdad0cd
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Kanza, Samantha
b73bcf34-3ff8-4691-bd09-aa657dcff420
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Jäger, Christof
(2022)
AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not.
Frey, Jeremy G., Kanza, Samantha and Niranjan, Mahesan
(eds.)
AI4SD Network+ Conference, Chilworth Manor , Southampton, United Kingdom.
01 - 03 Mar 2022.
(doi:10.5258/SOTON/AI3SD0200).
Record type:
Conference or Workshop Item
(Other)
Abstract
With simulating the dynamic behaviour of ever bigger molecular systems for longer simulation time we simultaneously achieve more realistic timescales and gain much better insights into the physiological relevant time dependent behaviour of molecular systems, but also generate significantly more data and thus pose new challenges for filtering noise and analysing the simulation data. In simulation analysis and data dimensionality reduction we often rely on linear dependencies and behaviour within the simulated timespan. This generally is true for systems that show slow structural or conformational transitions over the simulated timespan. For example, proteins and enzyme simulations that undergo large scale conformational changes can adequately be analysed by methods of principle component analysis (PCA) and analysing and visualising low frequency normal modes. However, much more flexible molecular systems undergoing multiple and seemingly chaotic conformational changes are posing challenges for their analysis. Here, we need to advance our analysis toolbox. Moreover, it is important to understand the flexibility and linear behaviour of the simulated system before choosing the analysis methods. In this talk we present three very differently behaving molecular systems including an enzymatic activation process, a flexible self-assembling host guest system, and a highly flexible dataset of lipid molecules relevant for antibiotic resistance of mycobacterium tuberculosis. We will show how chaos theory can help to understand the flexibility patterns of the simulations, then present classical PCA based simulation analysis, before introducing the opportunities and challenges for unsupervised machine learning methodologies. The presented methods will concentrate on competitive learning methods (self-organizing maps) and density based clustering algorithms (DB scan) to analysis dominating and hidden structural features in seemingly chaotic simulation data.
Video
ai4sd_march_2022_day_2_ChristofJager
- Version of Record
More information
Published date: 2 March 2022
Additional Information:
Christof Jäger is an Assistant Professor at the University of Nottingham working on computational chemistry in the field of enzyme design, biotechnology, supramolecular chemistry, and catalysis. Following his PhD in supramolecular computational chemistry at the Friedrich-Alexander-Universität (FAU) Erlangen- Nürnberg in Germany in 2010 and postdoctoral research on the design of organic electronic devices he moved to Nottingham in 2014. Since 2015 he worked as a Marie Curie COFUND and Nottingham Advanced Research Fellow on computational strategies for predictive enzyme engineering, before being promoted to his current position in 2018.
Venue - Dates:
AI4SD Network+ Conference, Chilworth Manor , Southampton, United Kingdom, 2022-03-01 - 2022-03-03
Identifiers
Local EPrints ID: 468638
URI: http://eprints.soton.ac.uk/id/eprint/468638
PURE UUID: 40c7b9d8-1395-4d33-886c-440d8a94d890
Catalogue record
Date deposited: 19 Aug 2022 16:34
Last modified: 17 Mar 2024 03:51
Export record
Altmetrics
Contributors
Author:
Christof Jäger
Editor:
Mahesan Niranjan
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics