The University of Southampton
University of Southampton Institutional Repository

Simulation-based estimation methods for regression models with covariate measurement error

Simulation-based estimation methods for regression models with covariate measurement error
Simulation-based estimation methods for regression models with covariate measurement error

Estimation methods for generalized linear models with some covariates measured with error are considered. It is assumed that a response variable Y depends on a set of covariates, of which U are measured accurately but X are subject to measurement error so that surrogate variables Z are observed instead of X. The vector X may be mixed, that is it may contain both continuous and categorical variables. Two data sets are assumed to be available, a primary data set of observations (Yi, Ui, Zi) and a validation data set of observations (Xi, Zi, Ui). Two types of validation data are considered, internal data sets where the distribution of X given U is the same as in the primary data, and external validation data sets where the distribution is not the same in the two data sets.

Measurement error causes bias in the estimated coefficients of the model of interest. Information on the error obtained from the validation data are utilized to take the effects of measurement error into account and compute adjusted estimates which are less biased. The estimation methods considered here are structural modelling methods where all the distributions of the observed variables are specified parametrically. Distributions for X given U and for Z given X and U are defined for mixed covariates using models based on homogenous conditional Gaussian distributions. A rejection sampling method for generating variates from the distribution of X given U, Z and Y under these models is developed.

Four simulation-based estimation methods are described: (i) data augmentation, which is a Bayesian approach and a version of the Gibbs sampling algorithm; (ii) Monte Carlo EM, a modification of the standard EM algorithm with simulation-based Monte Carlo integration used at the E-step; (iii) a method based on using Monte Carlo integration to evaluate the density function of Y given the observed data; (iv) Monte Carlo regression calibration, a first-order approximation of method (iii). The estimators are based on general computational techniques developed for missing data problems, and they are thus applicable also to other measurement error models than the ones considered here.

University of Southampton
Kuha, Jouni Tapio
Kuha, Jouni Tapio

Kuha, Jouni Tapio (1995) Simulation-based estimation methods for regression models with covariate measurement error. University of Southampton, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

Estimation methods for generalized linear models with some covariates measured with error are considered. It is assumed that a response variable Y depends on a set of covariates, of which U are measured accurately but X are subject to measurement error so that surrogate variables Z are observed instead of X. The vector X may be mixed, that is it may contain both continuous and categorical variables. Two data sets are assumed to be available, a primary data set of observations (Yi, Ui, Zi) and a validation data set of observations (Xi, Zi, Ui). Two types of validation data are considered, internal data sets where the distribution of X given U is the same as in the primary data, and external validation data sets where the distribution is not the same in the two data sets.

Measurement error causes bias in the estimated coefficients of the model of interest. Information on the error obtained from the validation data are utilized to take the effects of measurement error into account and compute adjusted estimates which are less biased. The estimation methods considered here are structural modelling methods where all the distributions of the observed variables are specified parametrically. Distributions for X given U and for Z given X and U are defined for mixed covariates using models based on homogenous conditional Gaussian distributions. A rejection sampling method for generating variates from the distribution of X given U, Z and Y under these models is developed.

Four simulation-based estimation methods are described: (i) data augmentation, which is a Bayesian approach and a version of the Gibbs sampling algorithm; (ii) Monte Carlo EM, a modification of the standard EM algorithm with simulation-based Monte Carlo integration used at the E-step; (iii) a method based on using Monte Carlo integration to evaluate the density function of Y given the observed data; (iv) Monte Carlo regression calibration, a first-order approximation of method (iii). The estimators are based on general computational techniques developed for missing data problems, and they are thus applicable also to other measurement error models than the ones considered here.

This record has no associated files available for download.

More information

Published date: 1995

Identifiers

Local EPrints ID: 459928
URI: http://eprints.soton.ac.uk/id/eprint/459928
PURE UUID: cc67d6ca-282c-46dd-9a9f-ff00e0d03551

Catalogue record

Date deposited: 04 Jul 2022 17:28
Last modified: 04 Jul 2022 17:28

Export record

Contributors

Author: Jouni Tapio Kuha

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×