The University of Southampton
University of Southampton Institutional Repository

Statistical inference under nonignorable sampling and nonresponse - an empirical likelihood approach

Statistical inference under nonignorable sampling and nonresponse - an empirical likelihood approach
Statistical inference under nonignorable sampling and nonresponse - an empirical likelihood approach
Statistical models are often based on sample surveys. When the sample selection probabilities and/or the response probabilities are related to a model outcome variable, even after conditioning on the model covariates, the model holding for the observed data is different from the model holding in the population, resulting in biased inference if not accounted for properly. Accounting for sample selection bias is relatively simple because the sample selection probabilities are usually known. Accounting for nonignorable nonresponse is much harder since the response probabilities are, in practice, unknown. In this article, we develop a new approach for modelling complex survey data, which accounts simultaneously for nonignorable sampling and nonresponse. Our proposed approach combines the nonparametric empirical likelihood with a parametric model for the response probabilities, which contains the outcome variable as one of the covariates. Combining the model holding for the responding units with the model for the response probabilities enables extracting the model holding for the missing data and imputing them. We propose ways of testing the underlying model holding for the respondents’ data. Simulation results illustrate the good performance of the approach in terms of parameter estimation and imputation. We conclude with an application to the household expenditure survey in Israel, carried out by Israel’s Central Bureau of Statistics. The survey collects information on the socio-demographic characteristics of each member of the sampled households (HH), as well as detailed information on the HH income and expenditure. The total sample size was n = 12,136 with 7,827 responding HHs. The target estimated parameter in this application is the population mean of the gross HH income.
2325-0984
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc
Preminger, Arie
e00ab4c0-43b0-4df4-995c-4ec6bd07e36a
Sikov, Anna
81a74f0d-d006-49df-80f5-ed626b989828
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc
Preminger, Arie
e00ab4c0-43b0-4df4-995c-4ec6bd07e36a
Sikov, Anna
81a74f0d-d006-49df-80f5-ed626b989828

Pfeffermann, Danny, Preminger, Arie and Sikov, Anna (2025) Statistical inference under nonignorable sampling and nonresponse - an empirical likelihood approach. Journal of Survey Statistics and Methodology, [smaf015]. (doi:10.1093/jssam/smaf015).

Record type: Article

Abstract

Statistical models are often based on sample surveys. When the sample selection probabilities and/or the response probabilities are related to a model outcome variable, even after conditioning on the model covariates, the model holding for the observed data is different from the model holding in the population, resulting in biased inference if not accounted for properly. Accounting for sample selection bias is relatively simple because the sample selection probabilities are usually known. Accounting for nonignorable nonresponse is much harder since the response probabilities are, in practice, unknown. In this article, we develop a new approach for modelling complex survey data, which accounts simultaneously for nonignorable sampling and nonresponse. Our proposed approach combines the nonparametric empirical likelihood with a parametric model for the response probabilities, which contains the outcome variable as one of the covariates. Combining the model holding for the responding units with the model for the response probabilities enables extracting the model holding for the missing data and imputing them. We propose ways of testing the underlying model holding for the respondents’ data. Simulation results illustrate the good performance of the approach in terms of parameter estimation and imputation. We conclude with an application to the household expenditure survey in Israel, carried out by Israel’s Central Bureau of Statistics. The survey collects information on the socio-demographic characteristics of each member of the sampled households (HH), as well as detailed information on the HH income and expenditure. The total sample size was n = 12,136 with 7,827 responding HHs. The target estimated parameter in this application is the population mean of the gross HH income.

Text
smaf015_RevisedProof (6)- 04-10-2025 - Version of Record
Available under License Creative Commons Attribution.
Download (591kB)

More information

Accepted/In Press date: 4 October 2025
e-pub ahead of print date: 23 October 2025

Identifiers

Local EPrints ID: 506672
URI: http://eprints.soton.ac.uk/id/eprint/506672
ISSN: 2325-0984
PURE UUID: 39071b65-b9f0-4ce8-bd56-bbe5b844268d
ORCID for Danny Pfeffermann: ORCID iD orcid.org/0000-0001-7573-2829

Catalogue record

Date deposited: 13 Nov 2025 17:48
Last modified: 14 Nov 2025 02:34

Export record

Altmetrics

Contributors

Author: Arie Preminger
Author: Anna Sikov

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×