The University of Southampton
University of Southampton Institutional Repository

AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not

AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not
AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not
With simulating the dynamic behaviour of ever bigger molecular systems for longer simulation time we simultaneously achieve more realistic timescales and gain much better insights into the physiological relevant time dependent behaviour of molecular systems, but also generate significantly more data and thus pose new challenges for filtering noise and analysing the simulation data. In simulation analysis and data dimensionality reduction we often rely on linear dependencies and behaviour within the simulated timespan. This generally is true for systems that show slow structural or conformational transitions over the simulated timespan. For example, proteins and enzyme simulations that undergo large scale conformational changes can adequately be analysed by methods of principle component analysis (PCA) and analysing and visualising low frequency normal modes. However, much more flexible molecular systems undergoing multiple and seemingly chaotic conformational changes are posing challenges for their analysis. Here, we need to advance our analysis toolbox. Moreover, it is important to understand the flexibility and linear behaviour of the simulated system before choosing the analysis methods. In this talk we present three very differently behaving molecular systems including an enzymatic activation process, a flexible self-assembling host guest system, and a highly flexible dataset of lipid molecules relevant for antibiotic resistance of mycobacterium tuberculosis. We will show how chaos theory can help to understand the flexibility patterns of the simulations, then present classical PCA based simulation analysis, before introducing the opportunities and challenges for unsupervised machine learning methodologies. The presented methods will concentrate on competitive learning methods (self-organizing maps) and density based clustering algorithms (DB scan) to analysis dominating and hidden structural features in seemingly chaotic simulation data.
Jäger, Christof
1a4211d3-fa12-4005-b142-cd435cdad0cd
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Kanza, Samantha
b73bcf34-3ff8-4691-bd09-aa657dcff420
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Jäger, Christof
1a4211d3-fa12-4005-b142-cd435cdad0cd
Frey, Jeremy G.
ba60c559-c4af-44f1-87e6-ce69819bf23f
Kanza, Samantha
b73bcf34-3ff8-4691-bd09-aa657dcff420
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f

Jäger, Christof (2022) AI3SD Video: Making sense of highly flexible molecular simulations: Where AI can help and where not. Frey, Jeremy G., Kanza, Samantha and Niranjan, Mahesan (eds.) AI4SD Network+ Conference, Chilworth Manor , Southampton, United Kingdom. 01 - 03 Mar 2022. (doi:10.5258/SOTON/AI3SD0200).

Record type: Conference or Workshop Item (Other)

Abstract

With simulating the dynamic behaviour of ever bigger molecular systems for longer simulation time we simultaneously achieve more realistic timescales and gain much better insights into the physiological relevant time dependent behaviour of molecular systems, but also generate significantly more data and thus pose new challenges for filtering noise and analysing the simulation data. In simulation analysis and data dimensionality reduction we often rely on linear dependencies and behaviour within the simulated timespan. This generally is true for systems that show slow structural or conformational transitions over the simulated timespan. For example, proteins and enzyme simulations that undergo large scale conformational changes can adequately be analysed by methods of principle component analysis (PCA) and analysing and visualising low frequency normal modes. However, much more flexible molecular systems undergoing multiple and seemingly chaotic conformational changes are posing challenges for their analysis. Here, we need to advance our analysis toolbox. Moreover, it is important to understand the flexibility and linear behaviour of the simulated system before choosing the analysis methods. In this talk we present three very differently behaving molecular systems including an enzymatic activation process, a flexible self-assembling host guest system, and a highly flexible dataset of lipid molecules relevant for antibiotic resistance of mycobacterium tuberculosis. We will show how chaos theory can help to understand the flexibility patterns of the simulations, then present classical PCA based simulation analysis, before introducing the opportunities and challenges for unsupervised machine learning methodologies. The presented methods will concentrate on competitive learning methods (self-organizing maps) and density based clustering algorithms (DB scan) to analysis dominating and hidden structural features in seemingly chaotic simulation data.

Video
ai4sd_march_2022_day_2_ChristofJager - Version of Record
Available under License Creative Commons Attribution.
Download (355MB)

More information

Published date: 2 March 2022
Additional Information: Christof Jäger is an Assistant Professor at the University of Nottingham working on computational chemistry in the field of enzyme design, biotechnology, supramolecular chemistry, and catalysis. Following his PhD in supramolecular computational chemistry at the Friedrich-Alexander-Universität (FAU) Erlangen- Nürnberg in Germany in 2010 and postdoctoral research on the design of organic electronic devices he moved to Nottingham in 2014. Since 2015 he worked as a Marie Curie COFUND and Nottingham Advanced Research Fellow on computational strategies for predictive enzyme engineering, before being promoted to his current position in 2018.
Venue - Dates: AI4SD Network+ Conference, Chilworth Manor , Southampton, United Kingdom, 2022-03-01 - 2022-03-03

Identifiers

Local EPrints ID: 468638
URI: http://eprints.soton.ac.uk/id/eprint/468638
PURE UUID: 40c7b9d8-1395-4d33-886c-440d8a94d890
ORCID for Jeremy G. Frey: ORCID iD orcid.org/0000-0003-0842-4302
ORCID for Samantha Kanza: ORCID iD orcid.org/0000-0002-4831-9489
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 19 Aug 2022 16:34
Last modified: 17 Mar 2024 03:51

Export record

Altmetrics

Contributors

Author: Christof Jäger
Editor: Jeremy G. Frey ORCID iD
Editor: Samantha Kanza ORCID iD
Editor: Mahesan Niranjan ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×