The University of Southampton
University of Southampton Institutional Repository

Accounting for non‐ignorable sampling and non‐response in statistical matching

Accounting for non‐ignorable sampling and non‐response in statistical matching
Accounting for non‐ignorable sampling and non‐response in statistical matching

Data for statistical analysis is often available from different samples, with each sample containing measurements on only some of the variables of interest. Statistical matching attempts to generate a fused database containing matched measurements on all the target variables. In this article, we consider the use of statistical matching when the samples are drawn by informative sampling designs and are subject to not missing at random non-response. The problem with ignoring the sampling process and non-response is that the distribution of the data observed for the responding units can be very different from the distribution holding for the population data, which may distort the inference process and result in a matched database that misrepresents the joint distribution in the population. Our proposed methodology employs the empirical likelihood approach and is shown to perform well in a simulation experiment and when applied to real sample data.

IPF algorithm, NMAR non-response, empirical likelihood, fusion, matching uncertainty, sample and respondents distributions
0306-7734
Marella, Daniela
389be9c8-ff35-4ce5-91f6-3e29ccc53baa
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc
Marella, Daniela
389be9c8-ff35-4ce5-91f6-3e29ccc53baa
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc

Marella, Daniela and Pfeffermann, Danny (2022) Accounting for non‐ignorable sampling and non‐response in statistical matching. International Statistical Review. (doi:10.1111/insr.12524).

Record type: Article

Abstract

Data for statistical analysis is often available from different samples, with each sample containing measurements on only some of the variables of interest. Statistical matching attempts to generate a fused database containing matched measurements on all the target variables. In this article, we consider the use of statistical matching when the samples are drawn by informative sampling designs and are subject to not missing at random non-response. The problem with ignoring the sampling process and non-response is that the distribution of the data observed for the responding units can be very different from the distribution holding for the population data, which may distort the inference process and result in a matched database that misrepresents the joint distribution in the population. Our proposed methodology employs the empirical likelihood approach and is shown to perform well in a simulation experiment and when applied to real sample data.

Text
Statistical matching-Informative sampling and nonresponse - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (1MB)
Text
Int Statistical Rev - 2022 - Marella - Accounting for Non‐ignorable Sampling and Non‐response in Statistical Matching - Version of Record
Available under License Creative Commons Attribution.
Download (605kB)

More information

Accepted/In Press date: 26 September 2022
e-pub ahead of print date: 19 October 2022
Keywords: IPF algorithm, NMAR non-response, empirical likelihood, fusion, matching uncertainty, sample and respondents distributions

Identifiers

Local EPrints ID: 472844
URI: http://eprints.soton.ac.uk/id/eprint/472844
ISSN: 0306-7734
PURE UUID: 7b13f52d-9070-41dd-a4dd-0b6783305ceb

Catalogue record

Date deposited: 20 Dec 2022 17:33
Last modified: 30 Oct 2023 04:10

Export record

Altmetrics

Contributors

Author: Daniela Marella

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×