The University of Southampton
University of Southampton Institutional Repository

Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?

Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, Ω⁠, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using Ω we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose Ω as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.]
1063-5157
121-138
Parag, Kris Varun
245f9c2d-66b3-4357-a893-d62f318d7a1c
Pybus, Oliver G.
5fa128e1-8eb8-4d38-b925-1d7869a07f99
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
Parag, Kris Varun
245f9c2d-66b3-4357-a893-d62f318d7a1c
Pybus, Oliver G.
5fa128e1-8eb8-4d38-b925-1d7869a07f99
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3

Parag, Kris Varun, Pybus, Oliver G. and Wu, Chieh-Hsi (2022) Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions? Systematic Biology, 71 (1), 121-138. (doi:10.1093/sysbio/syab037).

Record type: Article

Abstract

In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, Ω⁠, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using Ω we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose Ω as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.]

Text
syab037 - Version of Record
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 8 May 2021
e-pub ahead of print date: 13 May 2021
Published date: 1 January 2022

Identifiers

Local EPrints ID: 451024
URI: http://eprints.soton.ac.uk/id/eprint/451024
ISSN: 1063-5157
PURE UUID: dba5968f-bf2e-46f6-b9c9-435d3497443b
ORCID for Chieh-Hsi Wu: ORCID iD orcid.org/0000-0001-9386-725X

Catalogue record

Date deposited: 03 Sep 2021 16:31
Last modified: 17 Mar 2024 04:00

Export record

Altmetrics

Contributors

Author: Kris Varun Parag
Author: Oliver G. Pybus
Author: Chieh-Hsi Wu ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×