A comparison of approaches for implementation of k-nearest neighbour imputation for missing items in cross-national, time series data sets of economic indicators
A comparison of approaches for implementation of k-nearest neighbour imputation for missing items in cross-national, time series data sets of economic indicators
The need for predictive accuracy in the imputation for missing data in cross‐national, time series data is discussed and the possibility of requiring unconventional approaches to imputation, namely approaches which are tailored to the specific context and applied to individual instances of missing items is also discussed. Potential barriers to moving toward such an approach are mentioned and in particular, the demands on resources implied by that. A taxonomy of available observations is established with the aim of being able to use it to quickly and efficiently identify potential solutions for imputing missing data. A simulation study is conducted in which the relative performance of different k‐nearest neighbor imputation implementations are related to the context in which they are set to operate with a view to providing practitioners with a‐priori understanding of which techniques are likely to perform better under any given particular set of circumstances. A multinomial model is used to begin to investigate the interaction between imputation implementations, and the role that context might play in the accuracy of their imputations.
University of Southampton
Mason, Ben Ross
5f0dcaf5-4158-4891-9838-2c8920e2fc33
2018
Mason, Ben Ross
5f0dcaf5-4158-4891-9838-2c8920e2fc33
Tzavidis, Nikolaos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Pfeffermann, Danny
c7fe07a0-9715-42ce-b90b-1d4f2c2c6ffc
Mason, Ben Ross
(2018)
A comparison of approaches for implementation of k-nearest neighbour imputation for missing items in cross-national, time series data sets of economic indicators.
University of Southampton, Doctoral Thesis, 161pp.
Record type:
Thesis
(Doctoral)
Abstract
The need for predictive accuracy in the imputation for missing data in cross‐national, time series data is discussed and the possibility of requiring unconventional approaches to imputation, namely approaches which are tailored to the specific context and applied to individual instances of missing items is also discussed. Potential barriers to moving toward such an approach are mentioned and in particular, the demands on resources implied by that. A taxonomy of available observations is established with the aim of being able to use it to quickly and efficiently identify potential solutions for imputing missing data. A simulation study is conducted in which the relative performance of different k‐nearest neighbor imputation implementations are related to the context in which they are set to operate with a view to providing practitioners with a‐priori understanding of which techniques are likely to perform better under any given particular set of circumstances. A multinomial model is used to begin to investigate the interaction between imputation implementations, and the role that context might play in the accuracy of their imputations.
Text
A comparison of approaches for implementation of k-nearest neighbour imputation for missing items in cross-national, time series data sets of economic indicators
- Version of Record
More information
Published date: 2018
Identifiers
Local EPrints ID: 422171
URI: http://eprints.soton.ac.uk/id/eprint/422171
PURE UUID: da38f0d5-8302-45e7-ad0b-9b44255297d1
Catalogue record
Date deposited: 18 Jul 2018 16:30
Last modified: 16 Mar 2024 03:23
Export record
Contributors
Author:
Ben Ross Mason
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics