The University of Southampton
University of Southampton Institutional Repository

A combined iterative sure independence screening and Cox proportional hazard model for extracting and analyzing prognostic biomarkers of adenocarcinoma lung cancer

A combined iterative sure independence screening and Cox proportional hazard model for extracting and analyzing prognostic biomarkers of adenocarcinoma lung cancer
A combined iterative sure independence screening and Cox proportional hazard model for extracting and analyzing prognostic biomarkers of adenocarcinoma lung cancer

The selection of significant biomarkers is essential in researching cancer diagnosis and treatment. The independence screening method works substantively to select crucial features based on the conditional marginal selection method. But it may pretermit the concoction effect of some marginally less essential covariates. We aim to obtain a significant biomarker-specific prediction on overall survival to know their survival and death risk. In this work, an iterative sure independence screening (ISIS) scheme has been applied to extract features from the high-dimensional dataset of adenocarcinoma lung cancer. Conventional and Bayesian approaches of the Cox proportional hazard (CPH) model have been used for analyzing the data to provide interpretation and conclusions about survival estimates. The accelerated failure time model is also used as an alternative to the CPH model. A forest plot is employed to show the graphical representation of the meta-analysis of the study design. Utilizing ISIS, we selected up to 20 relevant features From the entire dataset of adenocarcinoma lung cancer; some of them are liable to produce a positive hazard ratio greater than 1, and some are less than 1. The P values associated with the selected biomarkers imply their statistical significance. Fourteen biomarkers have been identified with a hazard ratio of less than 1; the remaining 20 biomarkers are greater than 1. These 14 biomarkers produce less risk of death for patients with adenocarcinoma lung cancer, and the remaining six biomarkers result in a high risk of death from adenocarcinoma lung cancer.

Bayesian, Cox proportional hazard model, Feature selection, Iterative sure independence screening, Lung cancer
Bhattacharjee, Atanu
3aed8d6e-c8aa-4bf4-8092-e4396f6243c5
Dey, Jishu
6fc81b71-20e4-49d5-946d-5f0d65b9f979
Kumari, Pragya
e118d0bb-a6dc-4e20-ba18-ea2527e46877
Bhattacharjee, Atanu
3aed8d6e-c8aa-4bf4-8092-e4396f6243c5
Dey, Jishu
6fc81b71-20e4-49d5-946d-5f0d65b9f979
Kumari, Pragya
e118d0bb-a6dc-4e20-ba18-ea2527e46877

Bhattacharjee, Atanu, Dey, Jishu and Kumari, Pragya (2022) A combined iterative sure independence screening and Cox proportional hazard model for extracting and analyzing prognostic biomarkers of adenocarcinoma lung cancer. Healthcare Analytics, 2, [100108]. (doi:10.1016/j.health.2022.100108).

Record type: Article

Abstract

The selection of significant biomarkers is essential in researching cancer diagnosis and treatment. The independence screening method works substantively to select crucial features based on the conditional marginal selection method. But it may pretermit the concoction effect of some marginally less essential covariates. We aim to obtain a significant biomarker-specific prediction on overall survival to know their survival and death risk. In this work, an iterative sure independence screening (ISIS) scheme has been applied to extract features from the high-dimensional dataset of adenocarcinoma lung cancer. Conventional and Bayesian approaches of the Cox proportional hazard (CPH) model have been used for analyzing the data to provide interpretation and conclusions about survival estimates. The accelerated failure time model is also used as an alternative to the CPH model. A forest plot is employed to show the graphical representation of the meta-analysis of the study design. Utilizing ISIS, we selected up to 20 relevant features From the entire dataset of adenocarcinoma lung cancer; some of them are liable to produce a positive hazard ratio greater than 1, and some are less than 1. The P values associated with the selected biomarkers imply their statistical significance. Fourteen biomarkers have been identified with a hazard ratio of less than 1; the remaining 20 biomarkers are greater than 1. These 14 biomarkers produce less risk of death for patients with adenocarcinoma lung cancer, and the remaining six biomarkers result in a high risk of death from adenocarcinoma lung cancer.

This record has no associated files available for download.

More information

Published date: 13 November 2022
Additional Information: Publisher Copyright: © 2022 The Author(s)
Keywords: Bayesian, Cox proportional hazard model, Feature selection, Iterative sure independence screening, Lung cancer

Identifiers

Local EPrints ID: 487639
URI: http://eprints.soton.ac.uk/id/eprint/487639
PURE UUID: d37c813e-fff9-4d00-9a63-6935930101b4
ORCID for Jishu Dey: ORCID iD orcid.org/0000-0002-0356-7454

Catalogue record

Date deposited: 29 Feb 2024 17:54
Last modified: 18 Mar 2024 04:14

Export record

Altmetrics

Contributors

Author: Atanu Bhattacharjee
Author: Jishu Dey ORCID iD
Author: Pragya Kumari

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×