Optimal design of experiments for MNAR data
Optimal design of experiments for MNAR data
The presence of missing data leads to biases in data analyses. To overcome these biases, it is crucial to understand the type of missing data that is present in the data. Amongst the three types of missing data (known as missing data mechanisms) that will be formally introduced in this thesis, the Missing Not At Random (MNAR) mechanism is the most complex. MNAR poses the most difficulties as it is an untestable assumption based on the current incomplete data. A recovery of some of the missing data is required to test its presence. In this research, we developed two statistical tests for testing the presence of MNAR in datasets and provide the theoretical framework of the tests. In the first test, the recovery design consists of a random sampling of the responses whose covariates lie within a particular region while the second test is based on an assignment of probabilities. We introduced techniques from Design of Experiments to improve the properties of these tests. The developed tests are compared with a random follow-up of missing responses, which will act as our benchmark design throughout. We formulate an easy and simple conjecture that uses the empirical density of the covariates to obtain the recovery region. Through simulations, the performance of the
tests is evaluated.
Keywords: Missing data; Missing not at random; Selection model; Recovery region;
Conjecture.
University of Southampton
Adediran, Adetola Adedamola
9a2a4e93-d4d5-4e29-a264-91055bb8b0c7
2025
Adediran, Adetola Adedamola
9a2a4e93-d4d5-4e29-a264-91055bb8b0c7
Overstall, Antony
c1d6c8bd-1c5f-49ee-a845-ec9ec7b20910
Böhning, Dankmar
1df635d4-e3dc-44d0-b61d-5fd11f6434e1
Biedermann, Stefanie
fe3027d2-13c3-4d9a-bfef-bcc7c6415039
Adediran, Adetola Adedamola
(2025)
Optimal design of experiments for MNAR data.
University of Southampton, Doctoral Thesis, 159pp.
Record type:
Thesis
(Doctoral)
Abstract
The presence of missing data leads to biases in data analyses. To overcome these biases, it is crucial to understand the type of missing data that is present in the data. Amongst the three types of missing data (known as missing data mechanisms) that will be formally introduced in this thesis, the Missing Not At Random (MNAR) mechanism is the most complex. MNAR poses the most difficulties as it is an untestable assumption based on the current incomplete data. A recovery of some of the missing data is required to test its presence. In this research, we developed two statistical tests for testing the presence of MNAR in datasets and provide the theoretical framework of the tests. In the first test, the recovery design consists of a random sampling of the responses whose covariates lie within a particular region while the second test is based on an assignment of probabilities. We introduced techniques from Design of Experiments to improve the properties of these tests. The developed tests are compared with a random follow-up of missing responses, which will act as our benchmark design throughout. We formulate an easy and simple conjecture that uses the empirical density of the covariates to obtain the recovery region. Through simulations, the performance of the
tests is evaluated.
Keywords: Missing data; Missing not at random; Selection model; Recovery region;
Conjecture.
Text
ADETOLA ADEDIRAN THESIS
- Version of Record
Text
Final-thesis-submission-Examination-Miss-Adetola-Adediran (1)
Restricted to Repository staff only
More information
Published date: 2025
Identifiers
Local EPrints ID: 497272
URI: http://eprints.soton.ac.uk/id/eprint/497272
PURE UUID: cab4c28f-cf21-4ddf-b1fc-b02c0da9625a
Catalogue record
Date deposited: 17 Jan 2025 17:32
Last modified: 08 Feb 2025 02:46
Export record
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics