The University of Southampton
University of Southampton Institutional Repository

Identification of genetic factors associated with myeloid neoplasms

Identification of genetic factors associated with myeloid neoplasms
Identification of genetic factors associated with myeloid neoplasms
Myeloid neoplasms are clonal haematopoietic disorders characterised by the abnormal proliferation of specific myeloid cell types. The first part of this thesis focuses on mastocytosis, a rare haematological neoplasm characterised by the uncontrolled proliferation of mast cells. To test the hypothesis that germline variants can alter the risk of developing mastocytosis, a two stage case-control genome-wide association study was conducted in five European populations with 1,035 KITD816V-positive cases and 17,960 controls. This analysis identified three genome-wide significant SNPs: rs4616402 (Pmeta=1.37×10-15, I2=4.2), rs4662380 (Pmeta=2.11×10-12, I2=0) and rs13077541 (Pmeta=2.10×10-9, I2=0). Expression and methylation quantitative trait loci analysis were used to identify candidate genes located near the SNPs, specifically CEBPA, TEX41 and TBL1XR1. Statistical analysis with available clinical data, showed that rs4616402 was associated with age at presentation (P = 0.009; beta = 4.41; n = 422) in patients with non-advanced disease. Additional focused analysis identified suggestive associations between mastocytosis and genetic variation at TERT, TPSAB1/TPSB2, and IL13. Finally, a gene-based analysis was performed using the summary statistics of the stage 1 meta-analysis and multiple regression which suggested that the VEGFC gene is also associated with mastocytosis. The findings described in this thesis demonstrate that multiple inherited common risk variants predispose to KITD816V positive mastocytosis and provide novel avenues for functional investigation. In the second part of this thesis, the genetics of somatically acquired uniparental disomy (aUPD) in myeloid malignancies was investigated. Several regions of recurrent aUPD have been identified in patients affected with haematological neoplasms, many of which harbour somatic mutations that drive clonal proliferation. Similar regions of aUPD have also been identified in apparently healthy individuals, especially the elderly, which confer a tenfold increased risk of developing haematological malignancies. Large-scale sequencing initiatives of individuals unselected for cancer therefore represent a valuable resource to identify novel regions of aUPD and the underlying somatic mutations which drive clonal haematopoiesis (CH). Whole-exome sequence (WES) data for 49,996 individuals from the UK biobank (mean age = 56.5 years) was used to develop an automated pipeline for identifying aUPD regions and a new scoring system (gg score) to select aUPD regions with high confidence for manual review. Precision and recall were used to evaluate the gg score. The recall (or sensitivity) showed that it correctly identifies 55% of the predicted aUPD regions, although the model can also produce false negatives. On the other hand, the score performed well in term of precision and indicated that 90% of the aUPD regions were correctly classified. The methodology was then applied to WES data from a Swedish Case-Control study of Schizophrenia consisting of 12,380 samples and with a mean age of 65. Genes targeting the aUPD regions identified in the Swedish cohort are known (MPL, 1p; TET2, 4q; EZH2, 7q; JAK2, 9p; FLT3, 13q; MEG3-DLK1, 14q). Regions of aUPD were screened for somatic mutation if they were overlapping in two or more samples. However, only JAK2V617F was confirmed in all five samples with UPD9p and new aUPD regions with unknown gene target were not identified. This work showed that the frequency of sample with aUPD regions identified by WES data is lower (0.2-0.3%) than expected (1-2%) and provides an estimate what is needed in term of sample size to detect aUPD regions from WES data.
University of Southampton
Galata, Gabriella
5f6ca6b1-964d-4f1c-88ee-0194d0370de4
Galata, Gabriella
5f6ca6b1-964d-4f1c-88ee-0194d0370de4
Tapper, William
9d5ddc92-a8dd-4c78-ac67-c5867b62724c

Galata, Gabriella (2022) Identification of genetic factors associated with myeloid neoplasms. University of Southampton, Doctoral Thesis, 243pp.

Record type: Thesis (Doctoral)

Abstract

Myeloid neoplasms are clonal haematopoietic disorders characterised by the abnormal proliferation of specific myeloid cell types. The first part of this thesis focuses on mastocytosis, a rare haematological neoplasm characterised by the uncontrolled proliferation of mast cells. To test the hypothesis that germline variants can alter the risk of developing mastocytosis, a two stage case-control genome-wide association study was conducted in five European populations with 1,035 KITD816V-positive cases and 17,960 controls. This analysis identified three genome-wide significant SNPs: rs4616402 (Pmeta=1.37×10-15, I2=4.2), rs4662380 (Pmeta=2.11×10-12, I2=0) and rs13077541 (Pmeta=2.10×10-9, I2=0). Expression and methylation quantitative trait loci analysis were used to identify candidate genes located near the SNPs, specifically CEBPA, TEX41 and TBL1XR1. Statistical analysis with available clinical data, showed that rs4616402 was associated with age at presentation (P = 0.009; beta = 4.41; n = 422) in patients with non-advanced disease. Additional focused analysis identified suggestive associations between mastocytosis and genetic variation at TERT, TPSAB1/TPSB2, and IL13. Finally, a gene-based analysis was performed using the summary statistics of the stage 1 meta-analysis and multiple regression which suggested that the VEGFC gene is also associated with mastocytosis. The findings described in this thesis demonstrate that multiple inherited common risk variants predispose to KITD816V positive mastocytosis and provide novel avenues for functional investigation. In the second part of this thesis, the genetics of somatically acquired uniparental disomy (aUPD) in myeloid malignancies was investigated. Several regions of recurrent aUPD have been identified in patients affected with haematological neoplasms, many of which harbour somatic mutations that drive clonal proliferation. Similar regions of aUPD have also been identified in apparently healthy individuals, especially the elderly, which confer a tenfold increased risk of developing haematological malignancies. Large-scale sequencing initiatives of individuals unselected for cancer therefore represent a valuable resource to identify novel regions of aUPD and the underlying somatic mutations which drive clonal haematopoiesis (CH). Whole-exome sequence (WES) data for 49,996 individuals from the UK biobank (mean age = 56.5 years) was used to develop an automated pipeline for identifying aUPD regions and a new scoring system (gg score) to select aUPD regions with high confidence for manual review. Precision and recall were used to evaluate the gg score. The recall (or sensitivity) showed that it correctly identifies 55% of the predicted aUPD regions, although the model can also produce false negatives. On the other hand, the score performed well in term of precision and indicated that 90% of the aUPD regions were correctly classified. The methodology was then applied to WES data from a Swedish Case-Control study of Schizophrenia consisting of 12,380 samples and with a mean age of 65. Genes targeting the aUPD regions identified in the Swedish cohort are known (MPL, 1p; TET2, 4q; EZH2, 7q; JAK2, 9p; FLT3, 13q; MEG3-DLK1, 14q). Regions of aUPD were screened for somatic mutation if they were overlapping in two or more samples. However, only JAK2V617F was confirmed in all five samples with UPD9p and new aUPD regions with unknown gene target were not identified. This work showed that the frequency of sample with aUPD regions identified by WES data is lower (0.2-0.3%) than expected (1-2%) and provides an estimate what is needed in term of sample size to detect aUPD regions from WES data.

Text
Identification of genetic factors associated with myeloid neoplasms - Version of Record
Available under License University of Southampton Thesis Licence.
Download (20MB)
Text
29057485_Galata_Permission to deposit thesis - form RJP
Restricted to Repository staff only

More information

Submitted date: June 2021
Published date: 30 June 2022

Identifiers

Local EPrints ID: 475137
URI: http://eprints.soton.ac.uk/id/eprint/475137
PURE UUID: 24712782-7cf1-4122-8899-40acdc75e484
ORCID for William Tapper: ORCID iD orcid.org/0000-0002-5896-1889

Catalogue record

Date deposited: 10 Mar 2023 17:42
Last modified: 17 Mar 2024 02:49

Export record

Altmetrics

Contributors

Author: Gabriella Galata
Thesis advisor: William Tapper ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×