Distributions of test statistics for edge exclusion for graphical models
Distributions of test statistics for edge exclusion for graphical models
Three test statistics for single edge exclusion from the saturated model are considered: the likelihood ratio test, the Wald test and the efficient score test. Non-signed and signed square-root versions are used. Their distributions are investigated, in particular under the alternative hypothesis that the saturated model holds. The delta-method is used to derive approximating asymptotic normal distributions. A non-central chi-square approximation is also proposed.
The power of the three test statistics for single edge exclusion is studied in detail, both for graphical Gaussian models with p variables and for graphical log-linear models with two and three binary variables. Theoretical asymptotic power functions are derived for the non-signed and the signed square-root versions of each test statistic. The normal and the non-central chi-square approximations, previously derived, are used. The quality of the approximation is assessed by simulation.
The single-factor model and the latent class model (with all variables binary) are parameterised, within the framework of graphical models, as a graphical Gaussian model and as a graphical log-linear model, respectively. The implications of such parameterisations are discussed, in particular concerning the parameter space and its admissible regions. It is proved that marginalising over the latent variable, both in a single-factor graphical Gaussian model and in a single-factor graphical log-linear model, induces no conditional independencies between manifest variables and, therefore, an independence graph that is complete. Consequently, starting with the saturated model and performing backwards elimination model selection is suggested as the most appropriate way for the data analyst to detect the presence of a latent variable. However, since model selection is subject to type II error, it is recommended that the power of the test statistics for single edge exclusion from the saturated model is taken into account.
A parallel is made between results obtained for graphical Gaussian models and for graphical log-linear models and some guidelines/recommendations are given to the data analyst.
University of Southampton
Salgueiro, Maria de Fatima Ramalho Fernandes
0ebdd79d-c3de-46bb-ae3f-65ef60359103
2002
Salgueiro, Maria de Fatima Ramalho Fernandes
0ebdd79d-c3de-46bb-ae3f-65ef60359103
Salgueiro, Maria de Fatima Ramalho Fernandes
(2002)
Distributions of test statistics for edge exclusion for graphical models.
University of Southampton, Doctoral Thesis.
Record type:
Thesis
(Doctoral)
Abstract
Three test statistics for single edge exclusion from the saturated model are considered: the likelihood ratio test, the Wald test and the efficient score test. Non-signed and signed square-root versions are used. Their distributions are investigated, in particular under the alternative hypothesis that the saturated model holds. The delta-method is used to derive approximating asymptotic normal distributions. A non-central chi-square approximation is also proposed.
The power of the three test statistics for single edge exclusion is studied in detail, both for graphical Gaussian models with p variables and for graphical log-linear models with two and three binary variables. Theoretical asymptotic power functions are derived for the non-signed and the signed square-root versions of each test statistic. The normal and the non-central chi-square approximations, previously derived, are used. The quality of the approximation is assessed by simulation.
The single-factor model and the latent class model (with all variables binary) are parameterised, within the framework of graphical models, as a graphical Gaussian model and as a graphical log-linear model, respectively. The implications of such parameterisations are discussed, in particular concerning the parameter space and its admissible regions. It is proved that marginalising over the latent variable, both in a single-factor graphical Gaussian model and in a single-factor graphical log-linear model, induces no conditional independencies between manifest variables and, therefore, an independence graph that is complete. Consequently, starting with the saturated model and performing backwards elimination model selection is suggested as the most appropriate way for the data analyst to detect the presence of a latent variable. However, since model selection is subject to type II error, it is recommended that the power of the test statistics for single edge exclusion from the saturated model is taken into account.
A parallel is made between results obtained for graphical Gaussian models and for graphical log-linear models and some guidelines/recommendations are given to the data analyst.
Text
882835.pdf
- Version of Record
More information
Published date: 2002
Identifiers
Local EPrints ID: 464832
URI: http://eprints.soton.ac.uk/id/eprint/464832
PURE UUID: 4861778f-3fae-41ae-b07b-c9cf95cbd9cd
Catalogue record
Date deposited: 05 Jul 2022 00:04
Last modified: 16 Mar 2024 19:46
Export record
Contributors
Author:
Maria de Fatima Ramalho Fernandes Salgueiro
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics