Robust estimation for small domains in business surveys
Robust estimation for small domains in business surveys
Small area (or small domain) estimation is still rarely applied in business statistics, because of challenges arising from the skewness and variability of variables such as turnover. We examine a range of small area estimation methods as the basis for estimating the activity of industries within the retail sector in the Netherlands. We use tax register data and a sampling procedure which replicates the sampling for the retail sector of Statistics Netherlands’ Structural Business Survey as a basis for investigating the properties of small area estimators. In particular, we consider the use of the EBLUP under a random effects model and variations of the EBLUP derived under (a) a random effects model that includes a complex specification for the level 1 variance and (b) a random effects model that is fitted by using the survey weights. Although accounting for the survey weights in estimation is important, the impact of influential data points remains the main challenge in this case. The paper further explores the use of outlier robust estimators in business surveys, in particular a robust version of the EBLUP, M-regression based synthetic estimators, and M-quantile small area estimators. The latter family of small area estimators includes robust projective (without and with survey weights) and robust predictive versions. M-quantile methods have the lowest empirical mean squared error and are substantially better than direct estimators, though there is an open question about how to choose the tuning constant for bias adjustment in practice. The paper makes a further contribution by exploring a doubly robust approach comprising the use of survey weights in conjunction with outlier robust methods in small area estimation.
M-quantile regression, bias adjustment, outliers, small area estimation
312-334
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Bocci, Chiara
379e761a-a313-493d-a75a-ef184be59ce5
Tzavidis, Nikolaos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Krieg, Sabine
c6e8a3f9-9668-44da-b911-252cb7c4c60d
Smeets, Marc JE
b76a60fa-ae8d-4b04-9c3d-2a5c5a2c51d7
8 March 2021
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Bocci, Chiara
379e761a-a313-493d-a75a-ef184be59ce5
Tzavidis, Nikolaos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Krieg, Sabine
c6e8a3f9-9668-44da-b911-252cb7c4c60d
Smeets, Marc JE
b76a60fa-ae8d-4b04-9c3d-2a5c5a2c51d7
Smith, Paul A., Bocci, Chiara, Tzavidis, Nikolaos, Krieg, Sabine and Smeets, Marc JE
(2021)
Robust estimation for small domains in business surveys.
Journal of the Royal Statistical Society, Series C (Applied Statistics), 70 (2), .
(doi:10.1111/rssc.12460).
Abstract
Small area (or small domain) estimation is still rarely applied in business statistics, because of challenges arising from the skewness and variability of variables such as turnover. We examine a range of small area estimation methods as the basis for estimating the activity of industries within the retail sector in the Netherlands. We use tax register data and a sampling procedure which replicates the sampling for the retail sector of Statistics Netherlands’ Structural Business Survey as a basis for investigating the properties of small area estimators. In particular, we consider the use of the EBLUP under a random effects model and variations of the EBLUP derived under (a) a random effects model that includes a complex specification for the level 1 variance and (b) a random effects model that is fitted by using the survey weights. Although accounting for the survey weights in estimation is important, the impact of influential data points remains the main challenge in this case. The paper further explores the use of outlier robust estimators in business surveys, in particular a robust version of the EBLUP, M-regression based synthetic estimators, and M-quantile small area estimators. The latter family of small area estimators includes robust projective (without and with survey weights) and robust predictive versions. M-quantile methods have the lowest empirical mean squared error and are substantially better than direct estimators, though there is an open question about how to choose the tuning constant for bias adjustment in practice. The paper makes a further contribution by exploring a doubly robust approach comprising the use of survey weights in conjunction with outlier robust methods in small area estimation.
Text
Robust estimation for small domains in business surveys v8.3 AAM
- Accepted Manuscript
Text
Supplementary material Smith, Bocci, Tzavidis, Krieg & Smeets
More information
Accepted/In Press date: 13 October 2020
e-pub ahead of print date: 11 December 2020
Published date: 8 March 2021
Keywords:
M-quantile regression, bias adjustment, outliers, small area estimation
Identifiers
Local EPrints ID: 444934
URI: http://eprints.soton.ac.uk/id/eprint/444934
ISSN: 0035-9254
PURE UUID: 6d1fb16d-85c2-4141-afd4-e8f28691b471
Catalogue record
Date deposited: 12 Nov 2020 17:31
Last modified: 17 Mar 2024 06:02
Export record
Altmetrics
Contributors
Author:
Chiara Bocci
Author:
Sabine Krieg
Author:
Marc JE Smeets
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics