Correcting for measurement error when estimating pay distributions from household survey data
Correcting for measurement error when estimating pay distributions from household survey data
The aim of this thesis is to develop and to evaluate different methods for estimating distributions in the presence of measurement error and missing data with a primary focus on a specific application concerning pay. Different methods for correcting for measurement error in a fully observed variable are considered by taking into account information on the accurately measured variable observed on a non-random subsample. To compensate for nonresponse in the correct variable and to effectively correct for measurement error in the erroneously observed variable several imputation methods are proposed treating the problem of measurement error as a missing data problem. Based on the assumption that the data are missing at random (MAR) hot deck imputation within classes as a form of predictive mean matching imputation is evaluated theoretically and empirically. This method provides approximately unbiased estimates of the parameter of interest, the proportion below a given threshold. The problem of estimating the variance of the estimator under this imputation method is investigated. A variance estimator is proposed which allows for uncertainty due to imputation. It is shown that this estimator is approximately unbiased under certain conditions.
Since some evidence is found that the results under hot deck imputation within classes may depend on the choice of imputation classes other forms of predictive mean matching imputation are evaluated theoretically and empirically under the assumption of MAR. The imputation methods are also compared to propensity score weighing. The use of repeated imputation shows gains in efficacy in comparison to single value imputation. It is found that nearest neighbour imputation using repeated imputation shows advantages in terms of bias robustness and efficiency of the point estimator. It is therefore recommended for practical use.
Several estimation methods under nonignorable nonresponse are considered making an alternative assumption of common measurement error (CME). An imputation method using data augmentation based on the assumption of CME rather than MAR is derived, which shows desirable properties of the point estimator of interest. The use of hot deck imputation in the data augmentation procedure is proposed. Data augmentation using nearest neighbour imputation under the assumption of CME is found to have desirable properties for the pay application.
University of Southampton
Beissel-Durrant, Gabriele
9630d22e-5f26-4407-bcfd-9674a03b4ee1
2003
Beissel-Durrant, Gabriele
9630d22e-5f26-4407-bcfd-9674a03b4ee1
Beissel-Durrant, Gabriele
(2003)
Correcting for measurement error when estimating pay distributions from household survey data.
University of Southampton, Doctoral Thesis.
Record type:
Thesis
(Doctoral)
Abstract
The aim of this thesis is to develop and to evaluate different methods for estimating distributions in the presence of measurement error and missing data with a primary focus on a specific application concerning pay. Different methods for correcting for measurement error in a fully observed variable are considered by taking into account information on the accurately measured variable observed on a non-random subsample. To compensate for nonresponse in the correct variable and to effectively correct for measurement error in the erroneously observed variable several imputation methods are proposed treating the problem of measurement error as a missing data problem. Based on the assumption that the data are missing at random (MAR) hot deck imputation within classes as a form of predictive mean matching imputation is evaluated theoretically and empirically. This method provides approximately unbiased estimates of the parameter of interest, the proportion below a given threshold. The problem of estimating the variance of the estimator under this imputation method is investigated. A variance estimator is proposed which allows for uncertainty due to imputation. It is shown that this estimator is approximately unbiased under certain conditions.
Since some evidence is found that the results under hot deck imputation within classes may depend on the choice of imputation classes other forms of predictive mean matching imputation are evaluated theoretically and empirically under the assumption of MAR. The imputation methods are also compared to propensity score weighing. The use of repeated imputation shows gains in efficacy in comparison to single value imputation. It is found that nearest neighbour imputation using repeated imputation shows advantages in terms of bias robustness and efficiency of the point estimator. It is therefore recommended for practical use.
Several estimation methods under nonignorable nonresponse are considered making an alternative assumption of common measurement error (CME). An imputation method using data augmentation based on the assumption of CME rather than MAR is derived, which shows desirable properties of the point estimator of interest. The use of hot deck imputation in the data augmentation procedure is proposed. Data augmentation using nearest neighbour imputation under the assumption of CME is found to have desirable properties for the pay application.
Text
947986.pdf
- Version of Record
More information
Published date: 2003
Identifiers
Local EPrints ID: 465393
URI: http://eprints.soton.ac.uk/id/eprint/465393
PURE UUID: ca3a0b63-0eda-4b83-912e-3dc15d77935b
Catalogue record
Date deposited: 05 Jul 2022 00:42
Last modified: 16 Mar 2024 20:09
Export record
Contributors
Author:
Gabriele Beissel-Durrant
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
Loading...
View more statistics