Estimating propensity scores with missing covariate data using general location mixture models
Estimating propensity scores with missing covariate data using general location mixture models
In many observational studies, researchers estimate causal effects using propensity scores, e.g., by matching or sub-classifying on the scores. Estimation of propensity scores is complicated when some values of the covariates are
missing. We propose to use multiple imputation to create completed datasets, from which propensity scores can be estimated, with a general location mixture model. The model assumes that the control units are a latent mixture of (i)
units whose covariates are drawn from the same distributions as the treated units’ covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units’ region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations and better balance in the true covariate distributions. We illustrate the benefits of 1 the latent class modeling approach with simulations and with an observational
study of the effect of breast feeding on children’s cognitive abilities.
latent class, missing data, multiple imputation, observational studies, propensity score
Southampton Statistical Sciences Research Institute, University of Southampton
Mitra, Robin
2b944cd7-5be8-4dd1-ab44-f8ada9a33405
Reiter, Jerome P.
b70b3bb7-ac77-4dd1-860a-4daa92e02fc2
4 August 2009
Mitra, Robin
2b944cd7-5be8-4dd1-ab44-f8ada9a33405
Reiter, Jerome P.
b70b3bb7-ac77-4dd1-860a-4daa92e02fc2
Mitra, Robin and Reiter, Jerome P.
(2009)
Estimating propensity scores with missing covariate data using general location mixture models
(S3RI Methodology working papers, M09/13)
Southampton, UK.
Southampton Statistical Sciences Research Institute, University of Southampton
47pp.
Record type:
Monograph
(Working Paper)
Abstract
In many observational studies, researchers estimate causal effects using propensity scores, e.g., by matching or sub-classifying on the scores. Estimation of propensity scores is complicated when some values of the covariates are
missing. We propose to use multiple imputation to create completed datasets, from which propensity scores can be estimated, with a general location mixture model. The model assumes that the control units are a latent mixture of (i)
units whose covariates are drawn from the same distributions as the treated units’ covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units’ region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations and better balance in the true covariate distributions. We illustrate the benefits of 1 the latent class modeling approach with simulations and with an observational
study of the effect of breast feeding on children’s cognitive abilities.
Text
s3ri-workingpaper-M09-13.pdf
- Other
More information
Published date: 4 August 2009
Keywords:
latent class, missing data, multiple imputation, observational studies, propensity score
Identifiers
Local EPrints ID: 67154
URI: http://eprints.soton.ac.uk/id/eprint/67154
PURE UUID: 9669b728-280e-4205-94bb-9f3b7f5f15c2
Catalogue record
Date deposited: 06 Aug 2009
Last modified: 13 Mar 2024 18:45
Export record
Contributors
Author:
Jerome P. Reiter
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics