Applying large scale metanalysis of transcriptomic data to uncover hyper-responsive genes and prediction via machine learning
Applying large scale metanalysis of transcriptomic data to uncover hyper-responsive genes and prediction via machine learning
With the increasing adoption of high-throughput transcriptomic technologies there has been efforts to leverage these pre-existing datasets to improve the biological interpretability of bulk transcriptomic data analyses. Researchers have observed that genes exhibit markedly different responsiveness to perturbation- i.e. a subset of genes are more likely to exhibit large changes to their expression. This thesis expands on this area by developing a novel approach to bulk transcriptomic data analysis which leverages pre-existing datasets. Additionally, this thesis also showcases novel approach to feature selection for scRNA-seq datasets which also leverages pre-existing transcriptomic data to improve the clustering of annotated scRNA-seq datasets. Finally, this thesis utilises machine learning models to identify the key genomic and transcript-based features of these genes that explain the differences in genes’ responsiveness to perturbation.
University of Southampton
Coke, Brandon
9e52fdaf-3c8c-4c2b-b2a9-21486dcfd0a5
2025
Coke, Brandon
9e52fdaf-3c8c-4c2b-b2a9-21486dcfd0a5
Ewing, Rob
022c5b04-da20-4e55-8088-44d0dc9935ae
Coke, Brandon
(2025)
Applying large scale metanalysis of transcriptomic data to uncover hyper-responsive genes and prediction via machine learning.
University of Southampton, Doctoral Thesis, 231pp.
Record type:
Thesis
(Doctoral)
Abstract
With the increasing adoption of high-throughput transcriptomic technologies there has been efforts to leverage these pre-existing datasets to improve the biological interpretability of bulk transcriptomic data analyses. Researchers have observed that genes exhibit markedly different responsiveness to perturbation- i.e. a subset of genes are more likely to exhibit large changes to their expression. This thesis expands on this area by developing a novel approach to bulk transcriptomic data analysis which leverages pre-existing datasets. Additionally, this thesis also showcases novel approach to feature selection for scRNA-seq datasets which also leverages pre-existing transcriptomic data to improve the clustering of annotated scRNA-seq datasets. Finally, this thesis utilises machine learning models to identify the key genomic and transcript-based features of these genes that explain the differences in genes’ responsiveness to perturbation.
Text
Brandon Coke Thesis
- Version of Record
Restricted to Repository staff only until 30 June 2026.
Text
Final-thesis-submission-Examination-Mr-Brandon-Coke
Restricted to Repository staff only
More information
Published date: 2025
Identifiers
Local EPrints ID: 501875
URI: http://eprints.soton.ac.uk/id/eprint/501875
PURE UUID: fe07095d-0a3e-46b2-b5fa-0af0d6e86a66
Catalogue record
Date deposited: 11 Jun 2025 18:01
Last modified: 11 Sep 2025 03:18
Export record
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics