Use of nonprobability samples for official statistics, state of the art
Use of nonprobability samples for official statistics, state of the art
Tightened budgets, continuing decrease of response rates in traditional probability surveys and increasing pressure by users for more timely data, has stimulated research on the use of nonprobability sample data, such as administrative records, web scraping, mobile phone data and voluntary internet surveys, for inference on finite population parameters like means and totals. These data are often easier, faster and cheaper to collect than traditional probability samples. However, a major concern with the use of this kind of data for official statistics is their nonrepresentativeness due to possible selection bias, which if not accounted for properly, could bias the inference. In this article, we review and discuss methods considered in the literature to deal with this problem and propose new methods, distinguishing between methods based on integration of the nonprobability sample with an appropriate probability sample, and methods that base the inference solely on the nonprobability sample. Empirical illustrations, based on simulated data are provided.
169-196
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc
Sverchkov, Michael
e55f2540-b8a5-4142-9645-347834040e09
30 June 2025
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc
Sverchkov, Michael
e55f2540-b8a5-4142-9645-347834040e09
Pfeffermann, Danny and Sverchkov, Michael
(2025)
Use of nonprobability samples for official statistics, state of the art.
Survey Methodology, 51 (1), .
Abstract
Tightened budgets, continuing decrease of response rates in traditional probability surveys and increasing pressure by users for more timely data, has stimulated research on the use of nonprobability sample data, such as administrative records, web scraping, mobile phone data and voluntary internet surveys, for inference on finite population parameters like means and totals. These data are often easier, faster and cheaper to collect than traditional probability samples. However, a major concern with the use of this kind of data for official statistics is their nonrepresentativeness due to possible selection bias, which if not accounted for properly, could bias the inference. In this article, we review and discuss methods considered in the literature to deal with this problem and propose new methods, distinguishing between methods based on integration of the nonprobability sample with an appropriate probability sample, and methods that base the inference solely on the nonprobability sample. Empirical illustrations, based on simulated data are provided.
Text
Nonprobability_sampling-_Survey_Methodology_FINAL_(004)_JFB
- Accepted Manuscript
Available under License Other.
Text
Use of nonprobability samples for official statistics, state of the art
- Version of Record
Available under License Other.
More information
e-pub ahead of print date: 30 June 2025
Published date: 30 June 2025
Identifiers
Local EPrints ID: 505304
URI: http://eprints.soton.ac.uk/id/eprint/505304
ISSN: 0714-0045
PURE UUID: 9b6dcc97-bc04-454f-a628-bbbf19e8201d
Catalogue record
Date deposited: 06 Oct 2025 16:49
Last modified: 07 Oct 2025 01:36
Export record
Contributors
Author:
Michael Sverchkov
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics