Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures
Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures
Item nonresponse in survey data can pose significant problems for social scientists carrying out statistical modeling using a large number of explanatory variables. A number of imputation methods exist but many only deal with univariate imputation, or relatively simple cases of multivariate imputation, often assuming a monotone pattern of missingness. In this paper we evaluate a tree-based approach for multivariate imputation using real data from the 1970 British Cohort Study, known for its complex pattern of nonresponse. The performance of this tree-based approach is compared to mode imputation and a sequential regression based approach within a simulation study.
missing data, sequential imputation, classification tree, 1970 british birth cohort
Borgoni, Riccardo
df9c90ab-c2d2-47d6-bcc7-1444a605d6ff
Berrington, Ann
bd0fc093-310d-4236-8126-ca0c7eb9ddde
Borgoni, Riccardo
df9c90ab-c2d2-47d6-bcc7-1444a605d6ff
Berrington, Ann
bd0fc093-310d-4236-8126-ca0c7eb9ddde
Borgoni, Riccardo and Berrington, Ann
(2011)
Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures.
Quality & Quantity.
(doi:10.1007/s11135-011-9638-3).
Abstract
Item nonresponse in survey data can pose significant problems for social scientists carrying out statistical modeling using a large number of explanatory variables. A number of imputation methods exist but many only deal with univariate imputation, or relatively simple cases of multivariate imputation, often assuming a monotone pattern of missingness. In this paper we evaluate a tree-based approach for multivariate imputation using real data from the 1970 British Cohort Study, known for its complex pattern of nonresponse. The performance of this tree-based approach is compared to mode imputation and a sequential regression based approach within a simulation study.
Text
Borgoni_and_Berrington_Sequential_tree_df.pdf
- Version of Record
Restricted to Repository staff only
Request a copy
More information
e-pub ahead of print date: December 2011
Keywords:
missing data, sequential imputation, classification tree, 1970 british birth cohort
Organisations:
Social Statistics & Demography
Identifiers
Local EPrints ID: 201035
URI: http://eprints.soton.ac.uk/id/eprint/201035
ISSN: 0033-5177
PURE UUID: 6eb9e626-833b-4a6d-9e28-fc4d3e7b48fc
Catalogue record
Date deposited: 27 Oct 2011 13:41
Last modified: 15 Mar 2024 02:47
Export record
Altmetrics
Contributors
Author:
Riccardo Borgoni
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics