The University of Southampton
University of Southampton Institutional Repository

Statistical analysis of data from experiments subject to restricted randomisation

Statistical analysis of data from experiments subject to restricted randomisation
Statistical analysis of data from experiments subject to restricted randomisation
The selection of the best subset of variables, which will have a strong effect on an outcome of interest, is fundamental when avoiding overfitting in statistical modelling. However, when there are many variables, it is computationally difficult to find this best subset. The difficulties of variable selection would be more complex when designs are with restricted randomisation. This work aims to fill the gap of variable selection and model estimation for data from experiments subject to restricted randomisation by developing new methods for variable selection and model estimation using frequentist analysis and Bayesian analysis for experiments subject to restricted randomisation. Frequentist and Bayesian analysis methods are used to carry out a comparative study with respect to their performance in variable selection and model estimation. As a representative of frequentist analysis, the Penalised Generalised Least Square (PGLS) estimator is used in which a single shrinkage parameter is applied to all regression effects. Furthermore, as two different strata in split-plot design are existed, the PGLS approach is extended to perform variable selection and model estimation simultaneously in the context of split-plot design. The Penalised Generalised Least Squares for Split-Plot Design estimator (PGLS-SPD) is utilized, in which two shrinkage parameters are applied, one for the subplot effects and the other for the whole-plot effects. As a representative of Bayesian analysis, the Stochastic Search Variable Selection (SSVS) technique is used. This performs variable selection and model estimation simultaneously where the variance of all active factors will be sampled from one posterior distribution. As two different strata in split-plot design are existed, the SSVS approach to perform Bayesian variable selection is extended for the analysis of data from restricted randomised experiments by introducing the Stochastic Search Variable Selection for Split-Plot Design (SSVS-SPD) in which the variances of the active subplot and whole-plot factors are sampled from two different posterior distributions. The usefulness of frequentist and Bayesian approaches are demonstrated using two practical examples, and their properties are studied in simulation studies. The result of the comparative study of frequentist analysis and Bayesian analysis supports the utilization of SSVS-SPD method for the statistical analysis of data from experiments subject to restricted randomisation.s computationally difficult to find this best subset. The difficulties of variable selection would be more complex when designs are with restricted randomisation. This work aims to fill the gap of variable selection and model estimation for data from experiments subject to restricted randomisation by developing new methods for variable selection and model estimation using frequentist analysis and Bayesian analysis for experiments subject to restricted randomisation. Frequentist and Bayesian analysis methods are used to carry out a comparative study with respect to their performance in variable selection and model estimation. As a representative of frequentist analysis, the Penalised Generalised Least Square (PGLS) estimator is used in which a single shrinkage parameter is applied to all regression effects. Furthermore, as two different strata in split-plot design are existed, the PGLS approach is extended to perform variable selection and model estimation simultaneously in the context of split-plot design. The Penalised Generalised Least Squares for Split-Plot Design estimator (PGLS-SPD) is utilized, in which two shrinkage parameters are applied, one for the subplot effects and the other for the whole-plot effects. As a representative of Bayesian analysis, the Stochastic Search Variable Selection (SSVS) technique is used. This performs variable selection and model estimation simultaneously where the variance of all active factors will be sampled from one posterior distribution. As two different strata in split-plot design are existed, the SSVS approach to perform Bayesian variable selection is extended for the analysis of data from restricted randomised experiments by introducing the Stochastic Search Variable Selection for Split-Plot Design (SSVS-SPD) in which the variances of the active subplot and whole-plot factors are sampled from two different posterior distributions. The usefulness of frequentist and Bayesian approaches are demonstrated using two practical examples, and their properties are studied in simulation studies. The result of the comparative study of frequentist analysis and Bayesian analysis supports the utilization of SSVS-SPD method for the statistical analysis of data from experiments subject to restricted randomisation.
University of Southampton
Aljeddani, Sadiah
1de8477d-35ab-4a3d-ab01-6a5ad6891dc2
Aljeddani, Sadiah
1de8477d-35ab-4a3d-ab01-6a5ad6891dc2
Mylona, Kalliopi
b44af287-2d9f-4df8-931c-32d8ab117864

Aljeddani, Sadiah (2018) Statistical analysis of data from experiments subject to restricted randomisation. University of Southampton, Doctoral Thesis, 175pp.

Record type: Thesis (Doctoral)

Abstract

The selection of the best subset of variables, which will have a strong effect on an outcome of interest, is fundamental when avoiding overfitting in statistical modelling. However, when there are many variables, it is computationally difficult to find this best subset. The difficulties of variable selection would be more complex when designs are with restricted randomisation. This work aims to fill the gap of variable selection and model estimation for data from experiments subject to restricted randomisation by developing new methods for variable selection and model estimation using frequentist analysis and Bayesian analysis for experiments subject to restricted randomisation. Frequentist and Bayesian analysis methods are used to carry out a comparative study with respect to their performance in variable selection and model estimation. As a representative of frequentist analysis, the Penalised Generalised Least Square (PGLS) estimator is used in which a single shrinkage parameter is applied to all regression effects. Furthermore, as two different strata in split-plot design are existed, the PGLS approach is extended to perform variable selection and model estimation simultaneously in the context of split-plot design. The Penalised Generalised Least Squares for Split-Plot Design estimator (PGLS-SPD) is utilized, in which two shrinkage parameters are applied, one for the subplot effects and the other for the whole-plot effects. As a representative of Bayesian analysis, the Stochastic Search Variable Selection (SSVS) technique is used. This performs variable selection and model estimation simultaneously where the variance of all active factors will be sampled from one posterior distribution. As two different strata in split-plot design are existed, the SSVS approach to perform Bayesian variable selection is extended for the analysis of data from restricted randomised experiments by introducing the Stochastic Search Variable Selection for Split-Plot Design (SSVS-SPD) in which the variances of the active subplot and whole-plot factors are sampled from two different posterior distributions. The usefulness of frequentist and Bayesian approaches are demonstrated using two practical examples, and their properties are studied in simulation studies. The result of the comparative study of frequentist analysis and Bayesian analysis supports the utilization of SSVS-SPD method for the statistical analysis of data from experiments subject to restricted randomisation.s computationally difficult to find this best subset. The difficulties of variable selection would be more complex when designs are with restricted randomisation. This work aims to fill the gap of variable selection and model estimation for data from experiments subject to restricted randomisation by developing new methods for variable selection and model estimation using frequentist analysis and Bayesian analysis for experiments subject to restricted randomisation. Frequentist and Bayesian analysis methods are used to carry out a comparative study with respect to their performance in variable selection and model estimation. As a representative of frequentist analysis, the Penalised Generalised Least Square (PGLS) estimator is used in which a single shrinkage parameter is applied to all regression effects. Furthermore, as two different strata in split-plot design are existed, the PGLS approach is extended to perform variable selection and model estimation simultaneously in the context of split-plot design. The Penalised Generalised Least Squares for Split-Plot Design estimator (PGLS-SPD) is utilized, in which two shrinkage parameters are applied, one for the subplot effects and the other for the whole-plot effects. As a representative of Bayesian analysis, the Stochastic Search Variable Selection (SSVS) technique is used. This performs variable selection and model estimation simultaneously where the variance of all active factors will be sampled from one posterior distribution. As two different strata in split-plot design are existed, the SSVS approach to perform Bayesian variable selection is extended for the analysis of data from restricted randomised experiments by introducing the Stochastic Search Variable Selection for Split-Plot Design (SSVS-SPD) in which the variances of the active subplot and whole-plot factors are sampled from two different posterior distributions. The usefulness of frequentist and Bayesian approaches are demonstrated using two practical examples, and their properties are studied in simulation studies. The result of the comparative study of frequentist analysis and Bayesian analysis supports the utilization of SSVS-SPD method for the statistical analysis of data from experiments subject to restricted randomisation.

Text
Sadiah Aljeddani - Version of Record
Available under License University of Southampton Thesis Licence.
Download (5MB)

More information

Published date: July 2018

Identifiers

Local EPrints ID: 428057
URI: http://eprints.soton.ac.uk/id/eprint/428057
PURE UUID: 3a62873d-440c-4fad-8ec5-e01c7961cbaf

Catalogue record

Date deposited: 07 Feb 2019 17:30
Last modified: 16 Mar 2024 00:02

Export record

Contributors

Author: Sadiah Aljeddani
Thesis advisor: Kalliopi Mylona

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×