The University of Southampton
University of Southampton Institutional Repository

Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry

Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry
Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry
Recently, the numbers of oligonucleotides approved to the market for medicines has increased1 and take an important place in new drugs for undruggable diseases. For the safety of the patients, the drug, as well as any impurities from the synthesis, must be characterised.
Oligonucleotides can be analysed by negative ion electrospray ionisation (ESI) mass spectrometry with collision-induced dissociation (CID) where different product ions characteristic of the sequence are obtained. To understand and interpret the data, an expert analyst is needed to reconstruct the full sequence. Depending on the complexity and the length of the oligonucleotide and impurities present, it can take days, weeks, or months to sequence them manually. This can directly impact the time for the drug to be approved to the market. Some software packages exist to help the interpretation of the data but are limited due to a lack of detailed understanding how oligonucleotides dissociate in the gas phase inside the mass spectrometer.
To predict the dissociation of oligonucleotides, it is important to understand how simple sequences dissociate. To probe that, a library of unmodified and modified 21 mer oligonucleotides (phosphorothioate backbone rather than phosphodiester backbone) were synthesised to understand the impact of the different nucleobases present in the sequences, their localisation, and the impact of the different phosphate backbones on the dissociation. This library was analysed by direct infusion and RP-IP-LC-MS to obtain different charge states that were dissociated (with or without isolation) by isCID and CID. After dissociation the different charge states give different information and sequence coverages due to their conformation in the gas phase. During the assignment of the product ions, IMS has been used to help identify the charge states of the diverse product ions. After assignment of the product ions, the data were compared for the phosphodiester sequences, the phosphorothioate sequences, and then between the phosphodiester and phosphorothioate sequences.
The results from the library by high resolution negative ion ESI and CID mass spectrometry in the gas phase has shown that the dissociation is driven by the proton affinity of the nucleobases, the secondary structure by Watson-Crick pairing, the number of each nucleobase present in the sequence, their position, and their neighbours, depending on the charge state dissociated. By analysing, understanding, and comparing the same sequences with and without phosphorothioate phosphate backbone, new rules and observations have been obtained which are affected by modifying the phosphate backbone where the proton affinity of the nucleobases is impacted.
The use of this library will help to improve the current software to be more accurate and to predict the dissociation of more complex sequences depending to the charge state dissociated. Furthermore, the observations and rules obtained here could be used to develop a database to be incorporated into machine learning to predict the dissociation of known oligonucleotide structures and then characterise unknown oligonucleotides automatically. It would help the pharmaceutical industry to develop and characterise therapeutic oligonucleotides quicker, as well as their impurities, used for undruggable diseases. This will affect the lives of millions of patients worldwide to have access to their medicine faster.
Mass Spectrometry, oligonucleotide, Tandem Mass Spectrometry
University of Southampton
Hannauer, Fabien
b231daf2-4c82-49b0-87b6-bae7eb60499e
Hannauer, Fabien
b231daf2-4c82-49b0-87b6-bae7eb60499e
Langley, John
7ac80d61-b91d-4261-ad17-255f94ea21ea
Stulz, Eugen
9a6c04cf-32ca-442b-9281-bbf3d23c622d

Hannauer, Fabien (2024) Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry. University of Southampton, Doctoral Thesis, 326pp.

Record type: Thesis (Doctoral)

Abstract

Recently, the numbers of oligonucleotides approved to the market for medicines has increased1 and take an important place in new drugs for undruggable diseases. For the safety of the patients, the drug, as well as any impurities from the synthesis, must be characterised.
Oligonucleotides can be analysed by negative ion electrospray ionisation (ESI) mass spectrometry with collision-induced dissociation (CID) where different product ions characteristic of the sequence are obtained. To understand and interpret the data, an expert analyst is needed to reconstruct the full sequence. Depending on the complexity and the length of the oligonucleotide and impurities present, it can take days, weeks, or months to sequence them manually. This can directly impact the time for the drug to be approved to the market. Some software packages exist to help the interpretation of the data but are limited due to a lack of detailed understanding how oligonucleotides dissociate in the gas phase inside the mass spectrometer.
To predict the dissociation of oligonucleotides, it is important to understand how simple sequences dissociate. To probe that, a library of unmodified and modified 21 mer oligonucleotides (phosphorothioate backbone rather than phosphodiester backbone) were synthesised to understand the impact of the different nucleobases present in the sequences, their localisation, and the impact of the different phosphate backbones on the dissociation. This library was analysed by direct infusion and RP-IP-LC-MS to obtain different charge states that were dissociated (with or without isolation) by isCID and CID. After dissociation the different charge states give different information and sequence coverages due to their conformation in the gas phase. During the assignment of the product ions, IMS has been used to help identify the charge states of the diverse product ions. After assignment of the product ions, the data were compared for the phosphodiester sequences, the phosphorothioate sequences, and then between the phosphodiester and phosphorothioate sequences.
The results from the library by high resolution negative ion ESI and CID mass spectrometry in the gas phase has shown that the dissociation is driven by the proton affinity of the nucleobases, the secondary structure by Watson-Crick pairing, the number of each nucleobase present in the sequence, their position, and their neighbours, depending on the charge state dissociated. By analysing, understanding, and comparing the same sequences with and without phosphorothioate phosphate backbone, new rules and observations have been obtained which are affected by modifying the phosphate backbone where the proton affinity of the nucleobases is impacted.
The use of this library will help to improve the current software to be more accurate and to predict the dissociation of more complex sequences depending to the charge state dissociated. Furthermore, the observations and rules obtained here could be used to develop a database to be incorporated into machine learning to predict the dissociation of known oligonucleotide structures and then characterise unknown oligonucleotides automatically. It would help the pharmaceutical industry to develop and characterise therapeutic oligonucleotides quicker, as well as their impurities, used for undruggable diseases. This will affect the lives of millions of patients worldwide to have access to their medicine faster.

Text
Transforming Data to Understanding and Knowledge for the Sequencing of Therapeutic Oligonucleotides by Mass Spectrometry - Version of Record
Restricted to Repository staff only until 20 August 2026.
Available under License University of Southampton Thesis Licence.
Text
Final-thesis-submission-Examination-Mr-Fabien-Hannauer
Restricted to Repository staff only

More information

Published date: 2024
Keywords: Mass Spectrometry, oligonucleotide, Tandem Mass Spectrometry

Identifiers

Local EPrints ID: 494213
URI: http://eprints.soton.ac.uk/id/eprint/494213
PURE UUID: 287cae27-6120-43ae-b65e-6d2fe996d7cc
ORCID for Fabien Hannauer: ORCID iD orcid.org/0000-0002-8277-9817
ORCID for John Langley: ORCID iD orcid.org/0000-0002-8323-7235
ORCID for Eugen Stulz: ORCID iD orcid.org/0000-0002-5302-2276

Catalogue record

Date deposited: 30 Sep 2024 17:12
Last modified: 10 Jan 2025 03:08

Export record

Contributors

Author: Fabien Hannauer ORCID iD
Thesis advisor: John Langley ORCID iD
Thesis advisor: Eugen Stulz ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×