Estimating propensity scores with missing covariate data using general location mixture models
Estimating propensity scores with missing covariate data using general location mixture models
In many observational studies, analysts estimate causal effects using propensity scores, e.g. by matching, sub-classifying, or inverse probability weighting based on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. Analysts can use multiple imputation to create completed data sets from which propensity scores can be estimated. We propose a general location mixture model for imputations that assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the same distributions as the treated units' covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units' region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations. In turn, this can result in more reliable estimates of propensity scores and better balance in the true covariate distributions when matching or sub-classifying. We illustrate the benefits of the latent class modeling approach with simulations and with an observational study of the effect of breast feeding on children's cognitive abilities
627-641
Mitra, Robin
2b944cd7-5be8-4dd1-ab44-f8ada9a33405
Reiter, Jerome P.
b70b3bb7-ac77-4dd1-860a-4daa92e02fc2
March 2011
Mitra, Robin
2b944cd7-5be8-4dd1-ab44-f8ada9a33405
Reiter, Jerome P.
b70b3bb7-ac77-4dd1-860a-4daa92e02fc2
Mitra, Robin and Reiter, Jerome P.
(2011)
Estimating propensity scores with missing covariate data using general location mixture models.
Statistics in Medicine, 30 (6), .
(doi:10.1002/sim.4124).
Abstract
In many observational studies, analysts estimate causal effects using propensity scores, e.g. by matching, sub-classifying, or inverse probability weighting based on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. Analysts can use multiple imputation to create completed data sets from which propensity scores can be estimated. We propose a general location mixture model for imputations that assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the same distributions as the treated units' covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units' region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations. In turn, this can result in more reliable estimates of propensity scores and better balance in the true covariate distributions when matching or sub-classifying. We illustrate the benefits of the latent class modeling approach with simulations and with an observational study of the effect of breast feeding on children's cognitive abilities
More information
Published date: March 2011
Organisations:
Statistics
Identifiers
Local EPrints ID: 181581
URI: http://eprints.soton.ac.uk/id/eprint/181581
ISSN: 0277-6715
PURE UUID: dd6c14aa-4b53-4bfa-80b0-4b1f29b56db9
Catalogue record
Date deposited: 19 Apr 2011 13:36
Last modified: 14 Mar 2024 02:56
Export record
Altmetrics
Contributors
Author:
Jerome P. Reiter
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics