On the advantages of the non-concave penalized likelihood model selection method with minimum prediction errors in large-scale medical studies
On the advantages of the non-concave penalized likelihood model selection method with minimum prediction errors in large-scale medical studies
Variable and model selection problems are fundamental to high-dimensional statistical modeling in diverse fields of sciences. Especially in health studies, many potential factors are usually introduced to determine an outcome variable. This paper deals with the problem of high-dimensional statistical modeling through the analysis of the trauma annual data in Greece for 2005. The data set is divided into the experiment and control sets and consists of 6334 observations and 112 factors that include demographic, transport and intrahospital data used to detect possible risk factors of death. In our study, different model selection techniques are applied to the experiment set and the notion of deviance is used on the control set to assess the fit of the overall selected model. The statistical methods employed in this work were the non-concave penalized likelihood methods, smoothly clipped absolute deviation, least absolute shrinkage and selection operator, and Hard, the generalized linear logistic regression, and the best subset variable selection.The way of identifying the significant variables in large medical data sets along with the performance and the pros and cons of the various statistical techniques used are discussed. The performed analysis reveals the distinct advantages of the non-concave penalized likelihood methods over the traditional model selection techniques.
model selection, generalized linear model, non-concave penalized likelihood, high-dimensional data set, deviance, trauma
13-24
Karagrigoriou, A.
86c54998-ffee-4907-96e3-6b83d176610d
Koukouvinos, C.
3c626a53-575f-4c62-9b9a-a949f717764b
Mylona, K.
b44af287-2d9f-4df8-931c-32d8ab117864
2010
Karagrigoriou, A.
86c54998-ffee-4907-96e3-6b83d176610d
Koukouvinos, C.
3c626a53-575f-4c62-9b9a-a949f717764b
Mylona, K.
b44af287-2d9f-4df8-931c-32d8ab117864
Karagrigoriou, A., Koukouvinos, C. and Mylona, K.
(2010)
On the advantages of the non-concave penalized likelihood model selection method with minimum prediction errors in large-scale medical studies.
Journal of Applied Statistics, 37 (1), .
(doi:10.1080/02664760802638116).
Abstract
Variable and model selection problems are fundamental to high-dimensional statistical modeling in diverse fields of sciences. Especially in health studies, many potential factors are usually introduced to determine an outcome variable. This paper deals with the problem of high-dimensional statistical modeling through the analysis of the trauma annual data in Greece for 2005. The data set is divided into the experiment and control sets and consists of 6334 observations and 112 factors that include demographic, transport and intrahospital data used to detect possible risk factors of death. In our study, different model selection techniques are applied to the experiment set and the notion of deviance is used on the control set to assess the fit of the overall selected model. The statistical methods employed in this work were the non-concave penalized likelihood methods, smoothly clipped absolute deviation, least absolute shrinkage and selection operator, and Hard, the generalized linear logistic regression, and the best subset variable selection.The way of identifying the significant variables in large medical data sets along with the performance and the pros and cons of the various statistical techniques used are discussed. The performed analysis reveals the distinct advantages of the non-concave penalized likelihood methods over the traditional model selection techniques.
This record has no associated files available for download.
More information
e-pub ahead of print date: 15 December 2009
Published date: 2010
Keywords:
model selection, generalized linear model, non-concave penalized likelihood, high-dimensional data set, deviance, trauma
Organisations:
Statistics
Identifiers
Local EPrints ID: 336771
URI: http://eprints.soton.ac.uk/id/eprint/336771
ISSN: 0266-4763
PURE UUID: ee05d7d7-4cc5-4517-a3ba-860ff984189f
Catalogue record
Date deposited: 04 Apr 2012 15:39
Last modified: 14 Mar 2024 10:46
Export record
Altmetrics
Contributors
Author:
A. Karagrigoriou
Author:
C. Koukouvinos
Author:
K. Mylona
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics