Development and application of powerful methods for identifying selective sweeps
Development and application of powerful methods for identifying selective sweeps
Identifying regions of the genome under selection is an ongoing effort in population genetics and quantitative biology. Selection creates signals within the genome that can be detected using a variety of methods. One of the major obstacles is the effect of variable recombination rates that create regions of the genome containing highly correlated variants, confounding methods that rely on correlations as indicators of selection. This thesis is mainly focussed on developing the Zα method, which is a statistic that uses linkage disequilibrium patterns to identify sweeps while also integrating the recombination rate.
As recombination rates are an important confounder, and a key component of the Zα method, recombination rates in human populations were studied further. The aim was to see if populations with different ancestral backgrounds have different recombination rates, and if so, at what scale. This work shows that different human populations have similar recombination patterns at the wide scale, but at the fine scale they can be quite different. This result means that individual recombination maps are required for each population when using the Zα method.
To easily and efficiently generate the Zα statistics, an R package was developed, published, and made freely available. R is an open source programming language, which increases reproducibility, transparency and reliability of the method and any results generated using it. Finally, the statistic was applied to the genome of the domestic dog. Firstly, the recombination map was generated, and then the new R package was used to apply the Zα statistics including adjusting for the recombination rate. This study identified candidate regions for selection in the dog genome, both previously published and novel to this study.
University of Southampton
Horscroft, Clare
6ed6a58f-5fe9-4be0-a92e-bee5ea43aa8c
January 2022
Horscroft, Clare
6ed6a58f-5fe9-4be0-a92e-bee5ea43aa8c
Collins, Andrew
7daa83eb-0b21-43b2-af1a-e38fb36e2a64
Pengelly, Reuben
af97c0c1-b568-415c-9f59-1823b65be76d
Ennis, Sarah
7b57f188-9d91-4beb-b217-09856146f1e9
Sluckin, Tim
8dbb6b08-7034-4ae2-aa65-6b80072202f6
Horscroft, Clare
(2022)
Development and application of powerful methods for identifying selective sweeps.
University of Southampton, Doctoral Thesis, 346pp.
Record type:
Thesis
(Doctoral)
Abstract
Identifying regions of the genome under selection is an ongoing effort in population genetics and quantitative biology. Selection creates signals within the genome that can be detected using a variety of methods. One of the major obstacles is the effect of variable recombination rates that create regions of the genome containing highly correlated variants, confounding methods that rely on correlations as indicators of selection. This thesis is mainly focussed on developing the Zα method, which is a statistic that uses linkage disequilibrium patterns to identify sweeps while also integrating the recombination rate.
As recombination rates are an important confounder, and a key component of the Zα method, recombination rates in human populations were studied further. The aim was to see if populations with different ancestral backgrounds have different recombination rates, and if so, at what scale. This work shows that different human populations have similar recombination patterns at the wide scale, but at the fine scale they can be quite different. This result means that individual recombination maps are required for each population when using the Zα method.
To easily and efficiently generate the Zα statistics, an R package was developed, published, and made freely available. R is an open source programming language, which increases reproducibility, transparency and reliability of the method and any results generated using it. Finally, the statistic was applied to the genome of the domestic dog. Firstly, the recombination map was generated, and then the new R package was used to apply the Zα statistics including adjusting for the recombination rate. This study identified candidate regions for selection in the dog genome, both previously published and novel to this study.
Text
Development and Application of Powerful Methods for Identifying Selective Sweeps
- Version of Record
Text
Permission to deposit thesis - form - CHAC_TAN
Restricted to Repository staff only
More information
Published date: January 2022
Identifiers
Local EPrints ID: 475278
URI: http://eprints.soton.ac.uk/id/eprint/475278
PURE UUID: 09e0ad3e-c2cf-41f9-9342-1e2af1761765
Catalogue record
Date deposited: 14 Mar 2023 18:01
Last modified: 17 Mar 2024 03:33
Export record
Contributors
Author:
Clare Horscroft
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics