Model selection confidence sets by likelihood ratio testing
Model selection confidence sets by likelihood ratio testing
The traditional activity of model selection aims at discovering a single model superior to other candidate models. In the presence of pronounced noise, however, multiple models are often found to explain the same data equally well. To resolve this model selection ambiguity, we introduce the general approach of model selection confidence sets (MSCSs) based on likelihood ratio testing. A MSCS is defined as a list of models statistically indistinguishable from the true model at a user-specified level of confidence, which extends the familiar notion of confidence intervals to the model-selection framework. Our approach guarantees asymptotically correct coverage probability of the true model when both sample size and model dimension increase. We derive conditions under which the MSCS contains all the relevant information about the true model structure. In addition, we propose natural statistics based on the MSCS to measure importance of variables in a principled way that accounts for the overall model uncertainty. When the space of feasible models is large, MSCS is implemented by an adaptive stochastic search algorithm which samples MSCS models with high probability. The MSCS methodology is illustrated through numerical experiments on synthetic and real data examples.
827-851
Zheng, Chao
f3e2a919-4c02-4f5a-8de6-4c4de8ab6b60
Ferrari, Davide
b061fda3-174e-409f-8130-b09eca7e4c93
Yang, Yuhong
cc23782b-e4bb-4fe0-9419-490595903e0d
1 April 2019
Zheng, Chao
f3e2a919-4c02-4f5a-8de6-4c4de8ab6b60
Ferrari, Davide
b061fda3-174e-409f-8130-b09eca7e4c93
Yang, Yuhong
cc23782b-e4bb-4fe0-9419-490595903e0d
Zheng, Chao, Ferrari, Davide and Yang, Yuhong
(2019)
Model selection confidence sets by likelihood ratio testing.
Statistica Sinica, 29, .
(doi:10.5705/ss.202017.0006).
Abstract
The traditional activity of model selection aims at discovering a single model superior to other candidate models. In the presence of pronounced noise, however, multiple models are often found to explain the same data equally well. To resolve this model selection ambiguity, we introduce the general approach of model selection confidence sets (MSCSs) based on likelihood ratio testing. A MSCS is defined as a list of models statistically indistinguishable from the true model at a user-specified level of confidence, which extends the familiar notion of confidence intervals to the model-selection framework. Our approach guarantees asymptotically correct coverage probability of the true model when both sample size and model dimension increase. We derive conditions under which the MSCS contains all the relevant information about the true model structure. In addition, we propose natural statistics based on the MSCS to measure importance of variables in a principled way that accounts for the overall model uncertainty. When the space of feasible models is large, MSCS is implemented by an adaptive stochastic search algorithm which samples MSCS models with high probability. The MSCS methodology is illustrated through numerical experiments on synthetic and real data examples.
Text
MSCS_final
- Accepted Manuscript
More information
Accepted/In Press date: 1 September 2017
e-pub ahead of print date: 1 April 2019
Published date: 1 April 2019
Identifiers
Local EPrints ID: 441357
URI: http://eprints.soton.ac.uk/id/eprint/441357
ISSN: 1017-0405
PURE UUID: 0b16c5d8-a751-4665-aaa4-ee34100ca6de
Catalogue record
Date deposited: 10 Jun 2020 16:31
Last modified: 17 Mar 2024 04:02
Export record
Altmetrics
Contributors
Author:
Davide Ferrari
Author:
Yuhong Yang
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics