New routes to phylogeography: a Bayesian structured coalescent approximation
New routes to phylogeography: a Bayesian structured coalescent approximation
Phylogeographic methods aim to infer migration trends and the history of sampled lineages from genetic data. Applications of phylogeography are broad, and in the context of pathogens include the reconstruction of transmission histories and the origin and emergence of outbreaks. Phylogeographic inference based on bottom-up population genetics models is computationally expensive, and as a result faster alternatives based on the evolution of discrete traits have become popular. In this paper, we show that inference of migration rates and root locations based on discrete trait models is extremely unreliable and sensitive to biased sampling. To address this problem, we introduce BASTA (BAyesian STructured coalescent Approximation), a new approach implemented in BEAST2 that combines the accuracy of methods based on the structured coalescent with the computational efficiency required to handle more than just few populations. We illustrate the potentially severe implications of poor model choice for phylogeographic analyses by investigating the zoonotic transmission of Ebola virus. Whereas the structured coalescent analysis correctly infers that successive human Ebola outbreaks have been seeded by a large unsampled non-human reservoir population, the discrete trait analysis implausibly concludes that undetected human-to-human transmission has allowed the virus to persist over the past four decades. As genomics takes on an increasingly prominent role informing the control and prevention of infectious diseases, it will be vital that phylogeographic inference provides robust insights into transmission history.
De Maio, Nicola
d675e711-e9b1-4f7f-bc16-b67438373692
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
O’Reilly, Kathleen M
2d2eab27-8ae0-44dc-a284-f2b7f6858b02
Wilson, Daniel
4aa90f93-d50c-41e1-9988-35e445832ce0
12 August 2015
De Maio, Nicola
d675e711-e9b1-4f7f-bc16-b67438373692
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
O’Reilly, Kathleen M
2d2eab27-8ae0-44dc-a284-f2b7f6858b02
Wilson, Daniel
4aa90f93-d50c-41e1-9988-35e445832ce0
De Maio, Nicola, Wu, Chieh-Hsi, O’Reilly, Kathleen M and Wilson, Daniel
(2015)
New routes to phylogeography: a Bayesian structured coalescent approximation.
PLoS Genetics, 11 (8), [e1005421].
(doi:10.1371/journal.pgen.1005421).
Abstract
Phylogeographic methods aim to infer migration trends and the history of sampled lineages from genetic data. Applications of phylogeography are broad, and in the context of pathogens include the reconstruction of transmission histories and the origin and emergence of outbreaks. Phylogeographic inference based on bottom-up population genetics models is computationally expensive, and as a result faster alternatives based on the evolution of discrete traits have become popular. In this paper, we show that inference of migration rates and root locations based on discrete trait models is extremely unreliable and sensitive to biased sampling. To address this problem, we introduce BASTA (BAyesian STructured coalescent Approximation), a new approach implemented in BEAST2 that combines the accuracy of methods based on the structured coalescent with the computational efficiency required to handle more than just few populations. We illustrate the potentially severe implications of poor model choice for phylogeographic analyses by investigating the zoonotic transmission of Ebola virus. Whereas the structured coalescent analysis correctly infers that successive human Ebola outbreaks have been seeded by a large unsampled non-human reservoir population, the discrete trait analysis implausibly concludes that undetected human-to-human transmission has allowed the virus to persist over the past four decades. As genomics takes on an increasingly prominent role informing the control and prevention of infectious diseases, it will be vital that phylogeographic inference provides robust insights into transmission history.
Other
journal.pgen.1005421
- Version of Record
More information
Accepted/In Press date: 5 July 2015
e-pub ahead of print date: 12 August 2015
Published date: 12 August 2015
Identifiers
Local EPrints ID: 437898
URI: http://eprints.soton.ac.uk/id/eprint/437898
ISSN: 1553-7390
PURE UUID: e01b073e-7ba8-4de0-a670-af85857bd314
Catalogue record
Date deposited: 24 Feb 2020 17:30
Last modified: 17 Mar 2024 04:00
Export record
Altmetrics
Contributors
Author:
Nicola De Maio
Author:
Kathleen M O’Reilly
Author:
Daniel Wilson
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics