The University of Southampton
University of Southampton Institutional Repository

Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures

Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures
Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures
Item nonresponse in survey data can pose significant problems for social scientists carrying out statistical modeling using a large number of explanatory variables. A number of imputation methods exist but many only deal with univariate imputation, or relatively simple cases of multivariate imputation, often assuming a monotone pattern of missingness. In this paper we evaluate a tree-based approach for multivariate imputation using real data from the 1970 British Cohort Study, known for its complex pattern of nonresponse. The performance of this tree-based approach is compared to mode imputation and a sequential regression based approach within a simulation study.
missing data, sequential imputation, classification tree, 1970 british birth cohort
0033-5177
Borgoni, Riccardo
df9c90ab-c2d2-47d6-bcc7-1444a605d6ff
Berrington, Ann
bd0fc093-310d-4236-8126-ca0c7eb9ddde
Borgoni, Riccardo
df9c90ab-c2d2-47d6-bcc7-1444a605d6ff
Berrington, Ann
bd0fc093-310d-4236-8126-ca0c7eb9ddde

Borgoni, Riccardo and Berrington, Ann (2011) Evaluating a sequential tree-based procedure for multivariate imputation of complex missing data structures. Quality & Quantity. (doi:10.1007/s11135-011-9638-3).

Record type: Article

Abstract

Item nonresponse in survey data can pose significant problems for social scientists carrying out statistical modeling using a large number of explanatory variables. A number of imputation methods exist but many only deal with univariate imputation, or relatively simple cases of multivariate imputation, often assuming a monotone pattern of missingness. In this paper we evaluate a tree-based approach for multivariate imputation using real data from the 1970 British Cohort Study, known for its complex pattern of nonresponse. The performance of this tree-based approach is compared to mode imputation and a sequential regression based approach within a simulation study.

Text
Borgoni_and_Berrington_Sequential_tree_df.pdf - Version of Record
Restricted to Repository staff only
Request a copy

More information

e-pub ahead of print date: December 2011
Keywords: missing data, sequential imputation, classification tree, 1970 british birth cohort
Organisations: Social Statistics & Demography

Identifiers

Local EPrints ID: 201035
URI: http://eprints.soton.ac.uk/id/eprint/201035
ISSN: 0033-5177
PURE UUID: 6eb9e626-833b-4a6d-9e28-fc4d3e7b48fc
ORCID for Ann Berrington: ORCID iD orcid.org/0000-0002-1683-6668

Catalogue record

Date deposited: 27 Oct 2011 13:41
Last modified: 15 Mar 2024 02:47

Export record

Altmetrics

Contributors

Author: Riccardo Borgoni
Author: Ann Berrington ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×