A new principle for tuning-free Huber regression
A new principle for tuning-free Huber regression
The robustification parameter, which balances bias and robustness, has played a critical role in the construction of sub-Gaussian estimators for heavy-tailed and/or skewed data. Although it can be tuned by cross-validation in traditional practice, in large scale statistical problems such as high dimensional covariance matrix estimation and large scale multiple testing, the number of robustification parameters scales with the dimensionality so that cross-validation can be computationally prohibitive. In this paper, we propose a new data-driven principle to choose the robustification parameter for Huber-type sub-Gaussian estimators in three fundamental problems: mean estimation, linear regression, and sparse regression in high dimensions. Our proposal is guided by the non-asymptotic deviation analysis, and is conceptually different from cross-validation which relies on the mean squared error to assess the fit. Extensive numerical experiments and real data analysis further illustrate the efficacy of the proposed methods
Wang, Lili
b6fd1b8c-deaf-40a0-8fbd-ed7a1a8ccae1
Zheng, Chao
f3e2a919-4c02-4f5a-8de6-4c4de8ab6b60
Zhou, Wen
2430de12-12a8-4b74-b5f9-6e4c547f0837
Zhou, Wen-Xin
e1dc14bc-a81d-4aed-8250-fd4d072c1a46
Wang, Lili
b6fd1b8c-deaf-40a0-8fbd-ed7a1a8ccae1
Zheng, Chao
f3e2a919-4c02-4f5a-8de6-4c4de8ab6b60
Zhou, Wen
2430de12-12a8-4b74-b5f9-6e4c547f0837
Zhou, Wen-Xin
e1dc14bc-a81d-4aed-8250-fd4d072c1a46
Wang, Lili, Zheng, Chao, Zhou, Wen and Zhou, Wen-Xin
(2020)
A new principle for tuning-free Huber regression.
Statistica Sinica.
(In Press)
Abstract
The robustification parameter, which balances bias and robustness, has played a critical role in the construction of sub-Gaussian estimators for heavy-tailed and/or skewed data. Although it can be tuned by cross-validation in traditional practice, in large scale statistical problems such as high dimensional covariance matrix estimation and large scale multiple testing, the number of robustification parameters scales with the dimensionality so that cross-validation can be computationally prohibitive. In this paper, we propose a new data-driven principle to choose the robustification parameter for Huber-type sub-Gaussian estimators in three fundamental problems: mean estimation, linear regression, and sparse regression in high dimensions. Our proposal is guided by the non-asymptotic deviation analysis, and is conceptually different from cross-validation which relies on the mean squared error to assess the fit. Extensive numerical experiments and real data analysis further illustrate the efficacy of the proposed methods
Text
SS-2019-0045_na
- Accepted Manuscript
More information
Accepted/In Press date: 11 May 2020
Identifiers
Local EPrints ID: 441508
URI: http://eprints.soton.ac.uk/id/eprint/441508
ISSN: 1017-0405
PURE UUID: 11e599ea-ba23-4f1d-94f9-1fe1036d1c2f
Catalogue record
Date deposited: 16 Jun 2020 16:31
Last modified: 17 Mar 2024 04:02
Export record
Contributors
Author:
Lili Wang
Author:
Wen Zhou
Author:
Wen-Xin Zhou
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics