Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry
Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry
Recently, the numbers of oligonucleotides approved to the market for medicines has increased1 and take an important place in new drugs for undruggable diseases. For the safety of the patients, the drug, as well as any impurities from the synthesis, must be characterised.
Oligonucleotides can be analysed by negative ion electrospray ionisation (ESI) mass spectrometry with collision-induced dissociation (CID) where different product ions characteristic of the sequence are obtained. To understand and interpret the data, an expert analyst is needed to reconstruct the full sequence. Depending on the complexity and the length of the oligonucleotide and impurities present, it can take days, weeks, or months to sequence them manually. This can directly impact the time for the drug to be approved to the market. Some software packages exist to help the interpretation of the data but are limited due to a lack of detailed understanding how oligonucleotides dissociate in the gas phase inside the mass spectrometer.
To predict the dissociation of oligonucleotides, it is important to understand how simple sequences dissociate. To probe that, a library of unmodified and modified 21 mer oligonucleotides (phosphorothioate backbone rather than phosphodiester backbone) were synthesised to understand the impact of the different nucleobases present in the sequences, their localisation, and the impact of the different phosphate backbones on the dissociation. This library was analysed by direct infusion and RP-IP-LC-MS to obtain different charge states that were dissociated (with or without isolation) by isCID and CID. After dissociation the different charge states give different information and sequence coverages due to their conformation in the gas phase. During the assignment of the product ions, IMS has been used to help identify the charge states of the diverse product ions. After assignment of the product ions, the data were compared for the phosphodiester sequences, the phosphorothioate sequences, and then between the phosphodiester and phosphorothioate sequences.
The results from the library by high resolution negative ion ESI and CID mass spectrometry in the gas phase has shown that the dissociation is driven by the proton affinity of the nucleobases, the secondary structure by Watson-Crick pairing, the number of each nucleobase present in the sequence, their position, and their neighbours, depending on the charge state dissociated. By analysing, understanding, and comparing the same sequences with and without phosphorothioate phosphate backbone, new rules and observations have been obtained which are affected by modifying the phosphate backbone where the proton affinity of the nucleobases is impacted.
The use of this library will help to improve the current software to be more accurate and to predict the dissociation of more complex sequences depending to the charge state dissociated. Furthermore, the observations and rules obtained here could be used to develop a database to be incorporated into machine learning to predict the dissociation of known oligonucleotide structures and then characterise unknown oligonucleotides automatically. It would help the pharmaceutical industry to develop and characterise therapeutic oligonucleotides quicker, as well as their impurities, used for undruggable diseases. This will affect the lives of millions of patients worldwide to have access to their medicine faster.
Mass Spectrometry, oligonucleotide, Tandem Mass Spectrometry
University of Southampton
Hannauer, Fabien
b231daf2-4c82-49b0-87b6-bae7eb60499e
2024
Hannauer, Fabien
b231daf2-4c82-49b0-87b6-bae7eb60499e
Langley, John
7ac80d61-b91d-4261-ad17-255f94ea21ea
Stulz, Eugen
9a6c04cf-32ca-442b-9281-bbf3d23c622d
Hannauer, Fabien
(2024)
Transforming data to understanding and knowledge for the sequencing of therapeutic oligonucleotides by mass spectrometry.
University of Southampton, Doctoral Thesis, 326pp.
Record type:
Thesis
(Doctoral)
Abstract
Recently, the numbers of oligonucleotides approved to the market for medicines has increased1 and take an important place in new drugs for undruggable diseases. For the safety of the patients, the drug, as well as any impurities from the synthesis, must be characterised.
Oligonucleotides can be analysed by negative ion electrospray ionisation (ESI) mass spectrometry with collision-induced dissociation (CID) where different product ions characteristic of the sequence are obtained. To understand and interpret the data, an expert analyst is needed to reconstruct the full sequence. Depending on the complexity and the length of the oligonucleotide and impurities present, it can take days, weeks, or months to sequence them manually. This can directly impact the time for the drug to be approved to the market. Some software packages exist to help the interpretation of the data but are limited due to a lack of detailed understanding how oligonucleotides dissociate in the gas phase inside the mass spectrometer.
To predict the dissociation of oligonucleotides, it is important to understand how simple sequences dissociate. To probe that, a library of unmodified and modified 21 mer oligonucleotides (phosphorothioate backbone rather than phosphodiester backbone) were synthesised to understand the impact of the different nucleobases present in the sequences, their localisation, and the impact of the different phosphate backbones on the dissociation. This library was analysed by direct infusion and RP-IP-LC-MS to obtain different charge states that were dissociated (with or without isolation) by isCID and CID. After dissociation the different charge states give different information and sequence coverages due to their conformation in the gas phase. During the assignment of the product ions, IMS has been used to help identify the charge states of the diverse product ions. After assignment of the product ions, the data were compared for the phosphodiester sequences, the phosphorothioate sequences, and then between the phosphodiester and phosphorothioate sequences.
The results from the library by high resolution negative ion ESI and CID mass spectrometry in the gas phase has shown that the dissociation is driven by the proton affinity of the nucleobases, the secondary structure by Watson-Crick pairing, the number of each nucleobase present in the sequence, their position, and their neighbours, depending on the charge state dissociated. By analysing, understanding, and comparing the same sequences with and without phosphorothioate phosphate backbone, new rules and observations have been obtained which are affected by modifying the phosphate backbone where the proton affinity of the nucleobases is impacted.
The use of this library will help to improve the current software to be more accurate and to predict the dissociation of more complex sequences depending to the charge state dissociated. Furthermore, the observations and rules obtained here could be used to develop a database to be incorporated into machine learning to predict the dissociation of known oligonucleotide structures and then characterise unknown oligonucleotides automatically. It would help the pharmaceutical industry to develop and characterise therapeutic oligonucleotides quicker, as well as their impurities, used for undruggable diseases. This will affect the lives of millions of patients worldwide to have access to their medicine faster.
Text
Transforming Data to Understanding and Knowledge for the Sequencing of Therapeutic Oligonucleotides by Mass Spectrometry
- Version of Record
Restricted to Repository staff only until 20 August 2026.
Text
Final-thesis-submission-Examination-Mr-Fabien-Hannauer
Restricted to Repository staff only
More information
Published date: 2024
Keywords:
Mass Spectrometry, oligonucleotide, Tandem Mass Spectrometry
Identifiers
Local EPrints ID: 494213
URI: http://eprints.soton.ac.uk/id/eprint/494213
PURE UUID: 287cae27-6120-43ae-b65e-6d2fe996d7cc
Catalogue record
Date deposited: 30 Sep 2024 17:12
Last modified: 01 Oct 2024 02:03
Export record
Contributors
Author:
Fabien Hannauer
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics