Nonignorable nonresponse adjustment using fully nonparametric approach
Nonignorable nonresponse adjustment using fully nonparametric approach
Nonresponse is an increasingly common problem in surveys. It is a problem because it causes missing data and, more importantly, because such missing data are a potential source of bias for estimates. Most of the methods dealing with nonresponse assume either explicitly or implicitly that the missing values are missing at random (MAR). We consider the situations where the probability to respond may depend on the outcome value even after conditioning on the covariates. For this kind of response mechanism, the missing outcomes are not missing at random (NMAR). The problem of missing data is handled either using fully parametric or semi-parametric approaches. These approaches have some potential issues, for example, strict distributional assumptions, heavy computations, etc.
We propose a fully non-parametric approach; first we postulate informative individual response probabilities i.e. the response probability may depend on the values of interest, and it may be specific to each individual. We treat the outcome variable as a fixed constant just like in the design based approach to survey sampling. Then we use an estimating equations approach to define the finite population parameters. Hence the approach is fully non-parametric provided the individual specific response probabilities can be estimated non-parametrically. For longitudinal data it is possible that one can have individual historic response rate and those can be used as an empirical estimator for the individual specific response probability. We utilize this individual historic response rate as an estimator for the unknown response probability. If the unknown response probability is consistently estimated then the proof for consistency of estimators is much easier and much more common. But in our case the historic response rate is unbiased but not consistent because practically we cannot have infinitely many historic time points but we can have many units. We try to prove the asymptotic unbiasedness of estimating equations and further the consistency of estimates but we could not prove it and the reason is discussed in Section 2.4. It provides an interesting investigation of pursing consistency. We develop the associated variance estimator. Being a fully non-parametric and computationally simple method, it can be used as a widely applicable exploratory data analysis technique for NMAR mechanisms, as long as there exit a response history, in advance of more sophisticated and possibly more efficient modelling methods.
The approach is extended for a longitudinal setting and two types of EEs are defined to estimate
parameters that are defined over time, such as the change between two successive time points or the
regression coefficients involving outcomes over time. The associated variances estimators using
both EEs are also developed.
The non-parametric estimating equations (NEE) approach for cross-sectional and longitu- dinal
setting is not unbiased. We therefore develop bias-adjusting NEE approach to adjust the bias in
cross-sectional and longitudinal parameter estimates. Another advantage of the bias- adjusting EE
approach is that the variance estimator based on bias-adjusting NEE is expected to be less biased
as compared to the unadjusted approach. Moreover, Taylor expansion is used to adjust the bias in
variance estimate obtained from simple and bias-adjusted NEE approaches. A comprehensive simulation
study is conducted using real and simulated data to assess the performance of NEE and
bias-adjusted NEE approaches under various settings for cross-
sectional as well as for longitudinal data.
University of Southampton
Ahmad, Zahoor
1a4418f6-8855-4244-a89f-882eadff9721
February 2019
Ahmad, Zahoor
1a4418f6-8855-4244-a89f-882eadff9721
Zhang, Li-Chun
a5d48518-7f71-4ed9-bdcb-6585c2da3649
Ahmad, Zahoor
(2019)
Nonignorable nonresponse adjustment using fully nonparametric approach.
University of Southampton, Doctoral Thesis, 165pp.
Record type:
Thesis
(Doctoral)
Abstract
Nonresponse is an increasingly common problem in surveys. It is a problem because it causes missing data and, more importantly, because such missing data are a potential source of bias for estimates. Most of the methods dealing with nonresponse assume either explicitly or implicitly that the missing values are missing at random (MAR). We consider the situations where the probability to respond may depend on the outcome value even after conditioning on the covariates. For this kind of response mechanism, the missing outcomes are not missing at random (NMAR). The problem of missing data is handled either using fully parametric or semi-parametric approaches. These approaches have some potential issues, for example, strict distributional assumptions, heavy computations, etc.
We propose a fully non-parametric approach; first we postulate informative individual response probabilities i.e. the response probability may depend on the values of interest, and it may be specific to each individual. We treat the outcome variable as a fixed constant just like in the design based approach to survey sampling. Then we use an estimating equations approach to define the finite population parameters. Hence the approach is fully non-parametric provided the individual specific response probabilities can be estimated non-parametrically. For longitudinal data it is possible that one can have individual historic response rate and those can be used as an empirical estimator for the individual specific response probability. We utilize this individual historic response rate as an estimator for the unknown response probability. If the unknown response probability is consistently estimated then the proof for consistency of estimators is much easier and much more common. But in our case the historic response rate is unbiased but not consistent because practically we cannot have infinitely many historic time points but we can have many units. We try to prove the asymptotic unbiasedness of estimating equations and further the consistency of estimates but we could not prove it and the reason is discussed in Section 2.4. It provides an interesting investigation of pursing consistency. We develop the associated variance estimator. Being a fully non-parametric and computationally simple method, it can be used as a widely applicable exploratory data analysis technique for NMAR mechanisms, as long as there exit a response history, in advance of more sophisticated and possibly more efficient modelling methods.
The approach is extended for a longitudinal setting and two types of EEs are defined to estimate
parameters that are defined over time, such as the change between two successive time points or the
regression coefficients involving outcomes over time. The associated variances estimators using
both EEs are also developed.
The non-parametric estimating equations (NEE) approach for cross-sectional and longitu- dinal
setting is not unbiased. We therefore develop bias-adjusting NEE approach to adjust the bias in
cross-sectional and longitudinal parameter estimates. Another advantage of the bias- adjusting EE
approach is that the variance estimator based on bias-adjusting NEE is expected to be less biased
as compared to the unadjusted approach. Moreover, Taylor expansion is used to adjust the bias in
variance estimate obtained from simple and bias-adjusted NEE approaches. A comprehensive simulation
study is conducted using real and simulated data to assess the performance of NEE and
bias-adjusted NEE approaches under various settings for cross-
sectional as well as for longitudinal data.
Text
Zahoor Ahmad Final Thesis Submission
- Version of Record
Text
ZAhmad Permission to deposit thesis - form-filled-signed_RW
Restricted to Repository staff only
More information
Published date: February 2019
Identifiers
Local EPrints ID: 444047
URI: http://eprints.soton.ac.uk/id/eprint/444047
PURE UUID: a02629ea-87f5-465d-8b4a-645a4a62dd3c
Catalogue record
Date deposited: 23 Sep 2020 16:49
Last modified: 17 Mar 2024 03:30
Export record
Contributors
Author:
Zahoor Ahmad
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics