The University of Southampton
University of Southampton Institutional Repository

Nonignorable nonresponse adjustment using fully nonparametric approach

Nonignorable nonresponse adjustment using fully nonparametric approach
Nonignorable nonresponse adjustment using fully nonparametric approach
Nonresponse is an increasingly common problem in surveys. It is a problem because it causes missing data and, more importantly, because such missing data are a potential source of bias for estimates. Most of the methods dealing with nonresponse assume either explicitly or implicitly that the missing values are missing at random (MAR). We consider the situations where the probability to respond may depend on the outcome value even after conditioning on the covariates. For this kind of response mechanism, the missing outcomes are not missing at random (NMAR). The problem of missing data is handled either using fully parametric or semi-parametric approaches. These approaches have some potential issues, for example, strict distributional assumptions, heavy computations, etc.

We propose a fully non-parametric approach; first we postulate informative individual response probabilities i.e. the response probability may depend on the values of interest, and it may be specific to each individual. We treat the outcome variable as a fixed constant just like in the design based approach to survey sampling. Then we use an estimating equations approach to define the finite population parameters. Hence the approach is fully non-parametric provided the individual specific response probabilities can be estimated non-parametrically. For longitudinal data it is possible that one can have individual historic response rate and those can be used as an empirical estimator for the individual specific response probability. We utilize this individual historic response rate as an estimator for the unknown response probability. If the unknown response probability is consistently estimated then the proof for consistency of estimators is much easier and much more common. But in our case the historic response rate is unbiased but not consistent because practically we cannot have infinitely many historic time points but we can have many units. We try to prove the asymptotic unbiasedness of estimating equations and further the consistency of estimates but we could not prove it and the reason is discussed in Section 2.4. It provides an interesting investigation of pursing consistency. We develop the associated variance estimator. Being a fully non-parametric and computationally simple method, it can be used as a widely applicable exploratory data analysis technique for NMAR mechanisms, as long as there exit a response history, in advance of more sophisticated and possibly more efficient modelling methods.

The approach is extended for a longitudinal setting and two types of EEs are defined to estimate
parameters that are defined over time, such as the change between two successive time points or the
regression coefficients involving outcomes over time. The associated variances estimators using
both EEs are also developed.

The non-parametric estimating equations (NEE) approach for cross-sectional and longitu- dinal
setting is not unbiased. We therefore develop bias-adjusting NEE approach to adjust the bias in
cross-sectional and longitudinal parameter estimates. Another advantage of the bias- adjusting EE
approach is that the variance estimator based on bias-adjusting NEE is expected to be less biased
as compared to the unadjusted approach. Moreover, Taylor expansion is used to adjust the bias in
variance estimate obtained from simple and bias-adjusted NEE approaches. A comprehensive simulation
study is conducted using real and simulated data to assess the performance of NEE and
bias-adjusted NEE approaches under various settings for cross-
sectional as well as for longitudinal data.
University of Southampton
Ahmad, Zahoor
1a4418f6-8855-4244-a89f-882eadff9721
Ahmad, Zahoor
1a4418f6-8855-4244-a89f-882eadff9721
Zhang, Li-Chun
a5d48518-7f71-4ed9-bdcb-6585c2da3649

Ahmad, Zahoor (2019) Nonignorable nonresponse adjustment using fully nonparametric approach. University of Southampton, Doctoral Thesis, 165pp.

Record type: Thesis (Doctoral)

Abstract

Nonresponse is an increasingly common problem in surveys. It is a problem because it causes missing data and, more importantly, because such missing data are a potential source of bias for estimates. Most of the methods dealing with nonresponse assume either explicitly or implicitly that the missing values are missing at random (MAR). We consider the situations where the probability to respond may depend on the outcome value even after conditioning on the covariates. For this kind of response mechanism, the missing outcomes are not missing at random (NMAR). The problem of missing data is handled either using fully parametric or semi-parametric approaches. These approaches have some potential issues, for example, strict distributional assumptions, heavy computations, etc.

We propose a fully non-parametric approach; first we postulate informative individual response probabilities i.e. the response probability may depend on the values of interest, and it may be specific to each individual. We treat the outcome variable as a fixed constant just like in the design based approach to survey sampling. Then we use an estimating equations approach to define the finite population parameters. Hence the approach is fully non-parametric provided the individual specific response probabilities can be estimated non-parametrically. For longitudinal data it is possible that one can have individual historic response rate and those can be used as an empirical estimator for the individual specific response probability. We utilize this individual historic response rate as an estimator for the unknown response probability. If the unknown response probability is consistently estimated then the proof for consistency of estimators is much easier and much more common. But in our case the historic response rate is unbiased but not consistent because practically we cannot have infinitely many historic time points but we can have many units. We try to prove the asymptotic unbiasedness of estimating equations and further the consistency of estimates but we could not prove it and the reason is discussed in Section 2.4. It provides an interesting investigation of pursing consistency. We develop the associated variance estimator. Being a fully non-parametric and computationally simple method, it can be used as a widely applicable exploratory data analysis technique for NMAR mechanisms, as long as there exit a response history, in advance of more sophisticated and possibly more efficient modelling methods.

The approach is extended for a longitudinal setting and two types of EEs are defined to estimate
parameters that are defined over time, such as the change between two successive time points or the
regression coefficients involving outcomes over time. The associated variances estimators using
both EEs are also developed.

The non-parametric estimating equations (NEE) approach for cross-sectional and longitu- dinal
setting is not unbiased. We therefore develop bias-adjusting NEE approach to adjust the bias in
cross-sectional and longitudinal parameter estimates. Another advantage of the bias- adjusting EE
approach is that the variance estimator based on bias-adjusting NEE is expected to be less biased
as compared to the unadjusted approach. Moreover, Taylor expansion is used to adjust the bias in
variance estimate obtained from simple and bias-adjusted NEE approaches. A comprehensive simulation
study is conducted using real and simulated data to assess the performance of NEE and
bias-adjusted NEE approaches under various settings for cross-
sectional as well as for longitudinal data.

Text
Zahoor Ahmad Final Thesis Submission - Version of Record
Available under License University of Southampton Thesis Licence.
Download (1MB)
Text
ZAhmad Permission to deposit thesis - form-filled-signed_RW
Restricted to Repository staff only

More information

Published date: February 2019

Identifiers

Local EPrints ID: 444047
URI: http://eprints.soton.ac.uk/id/eprint/444047
PURE UUID: a02629ea-87f5-465d-8b4a-645a4a62dd3c
ORCID for Li-Chun Zhang: ORCID iD orcid.org/0000-0002-3944-9484

Catalogue record

Date deposited: 23 Sep 2020 16:49
Last modified: 17 Mar 2024 03:30

Export record

Contributors

Author: Zahoor Ahmad
Thesis advisor: Li-Chun Zhang ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×