Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring dynamical changes in the effective size of a population from a sampled phylogeny (or tree) of its sequences. Popular coalescent inference methods such as the Bayesian Skyline Plot, Skyride and Skygrid all model this population size with a discontinuous, piecewise-constant likelihood but apply a smoothing prior to ensure that posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent tree (data). Here we present a novel statistic, Ω, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using Ω we show that, because it is surprisingly easy to over-parametrise piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading conclusions, even under robust experimental designs. We propose Ω as a useful tool for detecting when posterior estimate precision is overly reliant on prior choices.
Parag, Kris Varun
245f9c2d-66b3-4357-a893-d62f318d7a1c
Pybus, Oliver G.
5fa128e1-8eb8-4d38-b925-1d7869a07f99
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
27 January 2020
Parag, Kris Varun
245f9c2d-66b3-4357-a893-d62f318d7a1c
Pybus, Oliver G.
5fa128e1-8eb8-4d38-b925-1d7869a07f99
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
[Unknown type: UNSPECIFIED]
Abstract
In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring dynamical changes in the effective size of a population from a sampled phylogeny (or tree) of its sequences. Popular coalescent inference methods such as the Bayesian Skyline Plot, Skyride and Skygrid all model this population size with a discontinuous, piecewise-constant likelihood but apply a smoothing prior to ensure that posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent tree (data). Here we present a novel statistic, Ω, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using Ω we show that, because it is surprisingly easy to over-parametrise piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading conclusions, even under robust experimental designs. We propose Ω as a useful tool for detecting when posterior estimate precision is overly reliant on prior choices.
This record has no associated files available for download.
More information
Accepted/In Press date: 27 January 2020
Published date: 27 January 2020
Additional Information:
This work has subsequently been published as Parag, K. V., Pybus, O. G., & Wu, C-H. (2022). Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions? Systematic Biology, 71(1), 121-138. https://doi.org/10.1093/sysbio/syab037
Identifiers
Local EPrints ID: 448617
URI: http://eprints.soton.ac.uk/id/eprint/448617
PURE UUID: 05520557-0ace-4c50-a5c7-2e50303bbbc1
Catalogue record
Date deposited: 28 Apr 2021 16:32
Last modified: 17 Mar 2024 04:00
Export record
Altmetrics
Contributors
Author:
Kris Varun Parag
Author:
Oliver G. Pybus
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics