Next-generation sequencing analyses in human disease and population genomics
Next-generation sequencing analyses in human disease and population genomics
Application of next-generation sequencing (NGS) in clinical diagnosis has enabled the efficient analysis of diverse genetic disorders. Rapid growth in the number of human genomes sequenced, underpinning a developing understanding of the disease-gene relationship. The high-throughput nature of NGS technology necessitates the need for a robust analytical framework for efficient and accurate genetic diagnosis. The higher degree of genetic variation identified in NGS applications often confounds the molecular diagnosis, and requires an enhanced strategy for the identification of causal variants. This thesis explores diverse applications of whole-exome and whole-genome sequencing at both the individual and population level for delineation of the human disease genome.
Design, implementation and benchmarking of efficient pipelines for analysing whole-exome and whole-genome sequencing data is first explored. Next, the diagnostic utility of the pipelines is examined in a range of rare disorders. This includes whole exome analysis of patients with hereditary nephrolithiasis, whole-exome and whole-genome analysis of a patient with severe skeletal dysplasia and targeted gene panel sequencing in a cohort of patients with syndromic cleft lip/palate (CLP). Through analyses of these cases, advantages and limitations of NGS analysis for establishing the molecular diagnosis in rare disorders are demonstrated. A novel method for ranking variants in the presence of phenotypic and genetic heterogeneity is introduced, and its diagnostic utility explored across syndromic CLP patients. While the application of variant-level attributes such as pathogenicity and conservation scores greatly facilitate molecular diagnosis, full resolution of the genetic architecture underlying disease genome depends on identification of factors that dictate the spatial distribution of pathogenic mutations across the genome. The nonrandom distribution of variants across the genome is the outcome of the complex interplay between selection, recombination and mutation which is reflected in genome-wide linkage disequilibrium (LD) patterns.
The final section of the thesis explores the possibility of delineating human disease genome from fine-scale LD maps in Sub-Saharan African populations (SSA). Extended population history in the SSA populations enables an unprecedented resolution for characterisation of LD patterns at sub-genic levels. LD maps constructed according to the Mal´ecot-Morton model from the whole-genome sequence data of 295 individuals from major SSA populations correlates closely with the proposed models of Bantu expansion across Africa. Furthermore, the relationship between gene-ontology groups, gene essentiality and gene-age with the extent of LD is investigated, and a model for identification of the association between the LD extent and gene-group assignment is proposed.
Overall, this thesis demonstrates many applications of NGS technology and highlights the common limitations involved in the analysis and interpretation of variants revealed from high throughput NGS analysis.
University of Southampton
Forooshani, Mohammad Reza Jabal Ameli
d533e702-7a6b-4f2d-8947-352ea1dd769b
October 2018
Forooshani, Mohammad Reza Jabal Ameli
d533e702-7a6b-4f2d-8947-352ea1dd769b
Ennis, Sarah
7b57f188-9d91-4beb-b217-09856146f1e9
Collins, Andrew
7daa83eb-0b21-43b2-af1a-e38fb36e2a64
Tapper, William
9d5ddc92-a8dd-4c78-ac67-c5867b62724c
Forooshani, Mohammad Reza Jabal Ameli
(2018)
Next-generation sequencing analyses in human disease and population genomics.
University of Southampton, Doctoral Thesis, 302pp.
Record type:
Thesis
(Doctoral)
Abstract
Application of next-generation sequencing (NGS) in clinical diagnosis has enabled the efficient analysis of diverse genetic disorders. Rapid growth in the number of human genomes sequenced, underpinning a developing understanding of the disease-gene relationship. The high-throughput nature of NGS technology necessitates the need for a robust analytical framework for efficient and accurate genetic diagnosis. The higher degree of genetic variation identified in NGS applications often confounds the molecular diagnosis, and requires an enhanced strategy for the identification of causal variants. This thesis explores diverse applications of whole-exome and whole-genome sequencing at both the individual and population level for delineation of the human disease genome.
Design, implementation and benchmarking of efficient pipelines for analysing whole-exome and whole-genome sequencing data is first explored. Next, the diagnostic utility of the pipelines is examined in a range of rare disorders. This includes whole exome analysis of patients with hereditary nephrolithiasis, whole-exome and whole-genome analysis of a patient with severe skeletal dysplasia and targeted gene panel sequencing in a cohort of patients with syndromic cleft lip/palate (CLP). Through analyses of these cases, advantages and limitations of NGS analysis for establishing the molecular diagnosis in rare disorders are demonstrated. A novel method for ranking variants in the presence of phenotypic and genetic heterogeneity is introduced, and its diagnostic utility explored across syndromic CLP patients. While the application of variant-level attributes such as pathogenicity and conservation scores greatly facilitate molecular diagnosis, full resolution of the genetic architecture underlying disease genome depends on identification of factors that dictate the spatial distribution of pathogenic mutations across the genome. The nonrandom distribution of variants across the genome is the outcome of the complex interplay between selection, recombination and mutation which is reflected in genome-wide linkage disequilibrium (LD) patterns.
The final section of the thesis explores the possibility of delineating human disease genome from fine-scale LD maps in Sub-Saharan African populations (SSA). Extended population history in the SSA populations enables an unprecedented resolution for characterisation of LD patterns at sub-genic levels. LD maps constructed according to the Mal´ecot-Morton model from the whole-genome sequence data of 295 individuals from major SSA populations correlates closely with the proposed models of Bantu expansion across Africa. Furthermore, the relationship between gene-ontology groups, gene essentiality and gene-age with the extent of LD is investigated, and a model for identification of the association between the LD extent and gene-group assignment is proposed.
Overall, this thesis demonstrates many applications of NGS technology and highlights the common limitations involved in the analysis and interpretation of variants revealed from high throughput NGS analysis.
Text
Reza Jabalameli ePrint
- Version of Record
More information
Published date: October 2018
Identifiers
Local EPrints ID: 430688
URI: http://eprints.soton.ac.uk/id/eprint/430688
PURE UUID: ac1bc631-c08e-448e-9e18-fd29136e5c24
Catalogue record
Date deposited: 08 May 2019 16:30
Last modified: 16 Mar 2024 03:07
Export record
Contributors
Author:
Mohammad Reza Jabal Ameli Forooshani
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics