The University of Southampton
University of Southampton Institutional Repository

A new principle for tuning-free Huber regression

A new principle for tuning-free Huber regression
A new principle for tuning-free Huber regression
The robustification parameter, which balances bias and robustness, has played a critical role in the construction of sub-Gaussian estimators for heavy-tailed and/or skewed data. Although it can be tuned by cross-validation in traditional practice, in large scale statistical problems such as high dimensional covariance matrix estimation and large scale multiple testing, the number of robustification parameters scales with the dimensionality so that cross-validation can be computationally prohibitive. In this paper, we propose a new data-driven principle to choose the robustification parameter for Huber-type sub-Gaussian estimators in three fundamental problems: mean estimation, linear regression, and sparse regression in high dimensions. Our proposal is guided by the non-asymptotic deviation analysis, and is conceptually different from cross-validation which relies on the mean squared error to assess the fit. Extensive numerical experiments and real data analysis further illustrate the efficacy of the proposed methods
1017-0405
Wang, Lili
b6fd1b8c-deaf-40a0-8fbd-ed7a1a8ccae1
Zheng, Chao
f3e2a919-4c02-4f5a-8de6-4c4de8ab6b60
Zhou, Wen
2430de12-12a8-4b74-b5f9-6e4c547f0837
Zhou, Wen-Xin
e1dc14bc-a81d-4aed-8250-fd4d072c1a46
Wang, Lili
b6fd1b8c-deaf-40a0-8fbd-ed7a1a8ccae1
Zheng, Chao
f3e2a919-4c02-4f5a-8de6-4c4de8ab6b60
Zhou, Wen
2430de12-12a8-4b74-b5f9-6e4c547f0837
Zhou, Wen-Xin
e1dc14bc-a81d-4aed-8250-fd4d072c1a46

Wang, Lili, Zheng, Chao, Zhou, Wen and Zhou, Wen-Xin (2020) A new principle for tuning-free Huber regression. Statistica Sinica. (In Press)

Record type: Article

Abstract

The robustification parameter, which balances bias and robustness, has played a critical role in the construction of sub-Gaussian estimators for heavy-tailed and/or skewed data. Although it can be tuned by cross-validation in traditional practice, in large scale statistical problems such as high dimensional covariance matrix estimation and large scale multiple testing, the number of robustification parameters scales with the dimensionality so that cross-validation can be computationally prohibitive. In this paper, we propose a new data-driven principle to choose the robustification parameter for Huber-type sub-Gaussian estimators in three fundamental problems: mean estimation, linear regression, and sparse regression in high dimensions. Our proposal is guided by the non-asymptotic deviation analysis, and is conceptually different from cross-validation which relies on the mean squared error to assess the fit. Extensive numerical experiments and real data analysis further illustrate the efficacy of the proposed methods

Text
SS-2019-0045_na - Accepted Manuscript
Download (3MB)

More information

Accepted/In Press date: 11 May 2020

Identifiers

Local EPrints ID: 441508
URI: http://eprints.soton.ac.uk/id/eprint/441508
ISSN: 1017-0405
PURE UUID: 11e599ea-ba23-4f1d-94f9-1fe1036d1c2f
ORCID for Chao Zheng: ORCID iD orcid.org/0000-0001-7943-6349

Catalogue record

Date deposited: 16 Jun 2020 16:31
Last modified: 17 Mar 2024 04:02

Export record

Contributors

Author: Lili Wang
Author: Chao Zheng ORCID iD
Author: Wen Zhou
Author: Wen-Xin Zhou

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×