The University of Southampton
University of Southampton Institutional Repository

Methods to identify novel disease genes and uplift diagnosis rates in rare diseases

Methods to identify novel disease genes and uplift diagnosis rates in rare diseases
Methods to identify novel disease genes and uplift diagnosis rates in rare diseases
Since the advent of next generation sequencing technologies, the ability to diagnose rare diseases has improved considerably. Yet despite advances, most rare diseases remain undiagnosed. In part, this is due to a demand for more efficient methods to interpret genomic sequencing data, in addition to the need to establish the phenotypic consequence of variants in genes not yet associated with disease. This thesis describes the development and testing of novel methods to improve diagnostic efficiency in patients with rare diseases, in addition to the discovery of novel disease-gene relationships. Herein describes the DeNovoLOEUF method, which identifies putative pathogenic de novo, loss-of-function variants in both known disease and putative disease genes. The gene-agnostic HiPPo protocol is further described, which prioritises variants identified in sequencing data. Finally, application of the GenePy dimensionality reduction algorithm to identify missed biallelic diagnoses is discussed. DeNovoLOEUF was applied in established disease genes to ~14,000 trios recruited to the 100,000 Genomes Project (100KGP). In total, 98% of all variants identified were proven diagnostic, including 39 new diagnoses missed by 100KGP. DeNovoLOEUF was then applied to novel genes to the same 100KGP cohort. A total of 18 putative disease genes were identified, whereby 12/18 (67%) of these genes have since been functionally validated. For the remaining 6 genes, case series are underway and two of these with supportive functional evidence are presented in this thesis: DDX17 (comprising 11 patients with de novo monoallelic variants and neurodevelopmental phenotypes, named Seaby-Ennis Syndrome); and HDLBP (comprising 7 patients with de novo monoallelic variants and neurodevelopmental phenotypes). Finally, application of the HiPPo protocol was demonstrated to be an effective, efficient, alternative method to interpret genomic data, capable of outperforming strategies used by the NHS Genomic Medicine Service (GMS). The GMS utilises gene panels to analyse sequence data, whereas HiPPo is a panel-agnostic method that prioritises variants using in silico metrics. HiPPo had a superior diagnostic rate per number of variant assessed when compared with the GMS (20% vs 3% respectively). HiPPo further identified all pathogenic variants reported by the GMS and identified an additional missed pathogenic variant. Data presented in this thesis demonstrate how novel methods applied to genomic sequencing data can efficiently enhance diagnosis rates for patients with rare diseases and identify new disease-gene relationships. In turn, these can improve patient outcomes by better elucidating mechanistic understanding of disease, identify novel therapeutic targets, and tailor treatments to specific diseases and individuals. To fully realise the potential of novel methods, additional research is needed. Future plans will involve the use of artificial intelligence to refine methods and models for improved clinical outcomes.
University of Southampton
Seaby, Eleanor
ec948f42-007c-4bd8-9dff-bb86278bf03f
Seaby, Eleanor Grace
f9011f96-bbc5-4364-970a-0f510489c539
Seaby, Eleanor
ec948f42-007c-4bd8-9dff-bb86278bf03f
Seaby, Eleanor Grace
f9011f96-bbc5-4364-970a-0f510489c539
Ennis, Sarah
7b57f188-9d91-4beb-b217-09856146f1e9
Baralle, Diana
faac16e5-7928-4801-9811-8b3a9ea4bb91

Seaby, Eleanor and Seaby, Eleanor Grace (2023) Methods to identify novel disease genes and uplift diagnosis rates in rare diseases. University of Southampton, Doctoral Thesis, 357pp.

Record type: Thesis (Doctoral)

Abstract

Since the advent of next generation sequencing technologies, the ability to diagnose rare diseases has improved considerably. Yet despite advances, most rare diseases remain undiagnosed. In part, this is due to a demand for more efficient methods to interpret genomic sequencing data, in addition to the need to establish the phenotypic consequence of variants in genes not yet associated with disease. This thesis describes the development and testing of novel methods to improve diagnostic efficiency in patients with rare diseases, in addition to the discovery of novel disease-gene relationships. Herein describes the DeNovoLOEUF method, which identifies putative pathogenic de novo, loss-of-function variants in both known disease and putative disease genes. The gene-agnostic HiPPo protocol is further described, which prioritises variants identified in sequencing data. Finally, application of the GenePy dimensionality reduction algorithm to identify missed biallelic diagnoses is discussed. DeNovoLOEUF was applied in established disease genes to ~14,000 trios recruited to the 100,000 Genomes Project (100KGP). In total, 98% of all variants identified were proven diagnostic, including 39 new diagnoses missed by 100KGP. DeNovoLOEUF was then applied to novel genes to the same 100KGP cohort. A total of 18 putative disease genes were identified, whereby 12/18 (67%) of these genes have since been functionally validated. For the remaining 6 genes, case series are underway and two of these with supportive functional evidence are presented in this thesis: DDX17 (comprising 11 patients with de novo monoallelic variants and neurodevelopmental phenotypes, named Seaby-Ennis Syndrome); and HDLBP (comprising 7 patients with de novo monoallelic variants and neurodevelopmental phenotypes). Finally, application of the HiPPo protocol was demonstrated to be an effective, efficient, alternative method to interpret genomic data, capable of outperforming strategies used by the NHS Genomic Medicine Service (GMS). The GMS utilises gene panels to analyse sequence data, whereas HiPPo is a panel-agnostic method that prioritises variants using in silico metrics. HiPPo had a superior diagnostic rate per number of variant assessed when compared with the GMS (20% vs 3% respectively). HiPPo further identified all pathogenic variants reported by the GMS and identified an additional missed pathogenic variant. Data presented in this thesis demonstrate how novel methods applied to genomic sequencing data can efficiently enhance diagnosis rates for patients with rare diseases and identify new disease-gene relationships. In turn, these can improve patient outcomes by better elucidating mechanistic understanding of disease, identify novel therapeutic targets, and tailor treatments to specific diseases and individuals. To fully realise the potential of novel methods, additional research is needed. Future plans will involve the use of artificial intelligence to refine methods and models for improved clinical outcomes.

Text
SEABY_thesis_FINAL_14Dec23_digitialversion - Version of Record
Available under License University of Southampton Thesis Licence.
Download (50MB)
Text
Final-thesis-submission-Examination-Dr-Eleanor-Seaby
Restricted to Repository staff only

More information

e-pub ahead of print date: 2023
Published date: 2023

Identifiers

Local EPrints ID: 485744
URI: http://eprints.soton.ac.uk/id/eprint/485744
PURE UUID: 78467521-c280-46c6-b500-8402ed98cbbe
ORCID for Eleanor Seaby: ORCID iD orcid.org/0000-0002-6814-8648
ORCID for Sarah Ennis: ORCID iD orcid.org/0000-0003-2648-0869
ORCID for Diana Baralle: ORCID iD orcid.org/0000-0003-3217-4833

Catalogue record

Date deposited: 18 Dec 2023 20:29
Last modified: 20 Mar 2024 03:01

Export record

Contributors

Author: Eleanor Seaby ORCID iD
Author: Eleanor Grace Seaby
Thesis advisor: Sarah Ennis ORCID iD
Thesis advisor: Diana Baralle ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×