Data from: Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
Data from: Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
This Matlab code was used to produce figures and results for the manuscript: "Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?" The main code can be used to compute the omega statistic presented in this paper. Key functions/scripts are: robustDecayTry.m: computes a p-correction for GMRF priors to omega compLowerBndsDecay.m: computes the bound on omega^2, used for model rejection modSelCurve.m, modSel_via_m.m, maxpModSel.m and omegaModelSelect.m: use to test model rejection algorithms under various conditions robustDecay.m and testEllip.m: looks at how Fisher information from prior influences the omega statistic and looks at uncertainty ellipses All files with the word 'King': computes omega statistic for Kingman coalescent *Update: now includes xmls for the empirical-based simulations for the bison and HCV examples in the revised Fig 5 and 6 of the main text.,In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data i.e. the tree. Here we present a novel statistic, Ω, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using Ω we show that, because it is surprisingly easy to over-parametrise piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose Ω as a useful tool for detecting when posterior estimate precision is overly reliant on prior choices.
Parag, Kris Varun
245f9c2d-66b3-4357-a893-d62f318d7a1c
Pybus, Oliver
5fa128e1-8eb8-4d38-b925-1d7869a07f99
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
Parag, Kris Varun
245f9c2d-66b3-4357-a893-d62f318d7a1c
Pybus, Oliver
5fa128e1-8eb8-4d38-b925-1d7869a07f99
Wu, Chieh-Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
(2020)
Data from: Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
DRYAD
doi:10.5061/dryad.1jwstqjs2
[Dataset]
Abstract
This Matlab code was used to produce figures and results for the manuscript: "Are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?" The main code can be used to compute the omega statistic presented in this paper. Key functions/scripts are: robustDecayTry.m: computes a p-correction for GMRF priors to omega compLowerBndsDecay.m: computes the bound on omega^2, used for model rejection modSelCurve.m, modSel_via_m.m, maxpModSel.m and omegaModelSelect.m: use to test model rejection algorithms under various conditions robustDecay.m and testEllip.m: looks at how Fisher information from prior influences the omega statistic and looks at uncertainty ellipses All files with the word 'King': computes omega statistic for Kingman coalescent *Update: now includes xmls for the empirical-based simulations for the bison and HCV examples in the revised Fig 5 and 6 of the main text.,In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data i.e. the tree. Here we present a novel statistic, Ω, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using Ω we show that, because it is surprisingly easy to over-parametrise piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose Ω as a useful tool for detecting when posterior estimate precision is overly reliant on prior choices.
This record has no associated files available for download.
More information
Published date: 2020
Identifiers
Local EPrints ID: 448612
URI: http://eprints.soton.ac.uk/id/eprint/448612
PURE UUID: f970f432-0d9a-42fd-a69f-1556b2677ff1
Catalogue record
Date deposited: 28 Apr 2021 16:31
Last modified: 06 May 2023 02:01
Export record
Altmetrics
Contributors
Contributor:
Kris Varun Parag
Contributor:
Oliver Pybus
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics