The University of Southampton
University of Southampton Institutional Repository

Biomarker Discovery for Metabolic Dysfunction-associated Steatotic Liver Disease Utilizing Mendelian Randomization, Machine Learning, and External Validation

Biomarker Discovery for Metabolic Dysfunction-associated Steatotic Liver Disease Utilizing Mendelian Randomization, Machine Learning, and External Validation
Biomarker Discovery for Metabolic Dysfunction-associated Steatotic Liver Disease Utilizing Mendelian Randomization, Machine Learning, and External Validation

Background and Aims: The causal biomarkers for metabolic dysfunction-associated steatotic liver disease (MASLD) and their clinical value remain unclear. In this study, we aimed to identify biomarkers for MASLD and evaluate their diagnostic and prognostic significance. Methods: We conducted a Mendelian randomization analysis to assess the causal effects of 2,925 molecular biomarkers (from proteomics data) and 35 clinical biomarkers on MASLD. Mediation analysis was performed to determine whether clinical biomarkers mediated the effects of molecular biomarkers. The association between key clinical biomarkers and MASLD was externally validated in a hospital-based cohort (n = 415). A machine learning–based diagnostic model for MASLD was developed and validated using the identified molecular biomarkers. Prognostic significance was evaluated for both molecular and clinical biomarkers. Results: Six molecular biomarkers—in-cluding canopy FGF signaling regulator 4 (CNPY4), ectonu-cleoside triphosphate diphosphohydrolase 6 (ENTPD6), and major histocompatibility complex, class I, A (HLA-A)—and eight clinical biomarkers (e.g., serum total protein (STP)) were identified as causally related to MASLD. STP partially mediated the effect of HLA-A on MASLD (23.61%) and was associated with MASLD in the external cohort (odds ratio = 1.080, 95% confidence interval: 1.011–1.155). A random forest model demonstrated high diagnostic performance (AUC = 0.941 in training; 0.875 in validation). High expres-sion levels of CNPY4 and ENTPD6 were associated with the development of and poorer survival from hepatocellular car-cinoma. Low STP (<60 g/L) predicted all-cause mortality (HR = 2.50, 95% confidence interval: 1.22–5.09). Conclusions: This study identifies six causal molecular biomarkers (e.g., CNPY4, ENTPD6, HLA-A) and eight clinical biomarkers for MASLD. Notably, STP mediates the effect of HLA-A on MASLD and is associated with all-cause mortality.

Causal biomarkers, Machine learning, Mediation analysis, Mendelian randomization, Metabolic dysfunction-associated fatty liver disease, Non-invasive diagnosis, Prognosis, Proteomics
2225-0719
723-733
Feng, Gong
01a55a38-0227-43aa-8eba-6ae8ca4665e0
Targher, Giovanni
bd646ac5-baa1-4944-8050-0c001b559283
Byrne, Chrisopher D.
1370b997-cead-4229-83a7-53301ed2a43c
He, Na
d3bd1563-d091-4746-8b84-6901304f87c1
Mi, Man
cf5d1581-a047-4db2-8e6d-cfd64154ceda
Liu, Yi
90fd6b8c-b22a-4910-ae5d-775f137193f1
Zhu, Hongbin
4506ac02-3d92-4233-96bb-9683bea4c8f2
Zheng, Ming-Hua
3a497ced-7e64-4f40-b442-702cc802c310
Ye, Feng
4e4bba72-e531-4131-9974-adfb3d090170
Feng, Gong
01a55a38-0227-43aa-8eba-6ae8ca4665e0
Targher, Giovanni
bd646ac5-baa1-4944-8050-0c001b559283
Byrne, Chrisopher D.
1370b997-cead-4229-83a7-53301ed2a43c
He, Na
d3bd1563-d091-4746-8b84-6901304f87c1
Mi, Man
cf5d1581-a047-4db2-8e6d-cfd64154ceda
Liu, Yi
90fd6b8c-b22a-4910-ae5d-775f137193f1
Zhu, Hongbin
4506ac02-3d92-4233-96bb-9683bea4c8f2
Zheng, Ming-Hua
3a497ced-7e64-4f40-b442-702cc802c310
Ye, Feng
4e4bba72-e531-4131-9974-adfb3d090170

Feng, Gong, Targher, Giovanni, Byrne, Chrisopher D., He, Na, Mi, Man, Liu, Yi, Zhu, Hongbin, Zheng, Ming-Hua and Ye, Feng (2025) Biomarker Discovery for Metabolic Dysfunction-associated Steatotic Liver Disease Utilizing Mendelian Randomization, Machine Learning, and External Validation. Journal of Clinical and Translational Hepatology, 13 (9), 723-733. (doi:10.14218/JCTH.2025.00270).

Record type: Article

Abstract

Background and Aims: The causal biomarkers for metabolic dysfunction-associated steatotic liver disease (MASLD) and their clinical value remain unclear. In this study, we aimed to identify biomarkers for MASLD and evaluate their diagnostic and prognostic significance. Methods: We conducted a Mendelian randomization analysis to assess the causal effects of 2,925 molecular biomarkers (from proteomics data) and 35 clinical biomarkers on MASLD. Mediation analysis was performed to determine whether clinical biomarkers mediated the effects of molecular biomarkers. The association between key clinical biomarkers and MASLD was externally validated in a hospital-based cohort (n = 415). A machine learning–based diagnostic model for MASLD was developed and validated using the identified molecular biomarkers. Prognostic significance was evaluated for both molecular and clinical biomarkers. Results: Six molecular biomarkers—in-cluding canopy FGF signaling regulator 4 (CNPY4), ectonu-cleoside triphosphate diphosphohydrolase 6 (ENTPD6), and major histocompatibility complex, class I, A (HLA-A)—and eight clinical biomarkers (e.g., serum total protein (STP)) were identified as causally related to MASLD. STP partially mediated the effect of HLA-A on MASLD (23.61%) and was associated with MASLD in the external cohort (odds ratio = 1.080, 95% confidence interval: 1.011–1.155). A random forest model demonstrated high diagnostic performance (AUC = 0.941 in training; 0.875 in validation). High expres-sion levels of CNPY4 and ENTPD6 were associated with the development of and poorer survival from hepatocellular car-cinoma. Low STP (<60 g/L) predicted all-cause mortality (HR = 2.50, 95% confidence interval: 1.22–5.09). Conclusions: This study identifies six causal molecular biomarkers (e.g., CNPY4, ENTPD6, HLA-A) and eight clinical biomarkers for MASLD. Notably, STP mediates the effect of HLA-A on MASLD and is associated with all-cause mortality.

Text
R1_MR Identifies Biomarkers for MASLD_JCTH20250622 - Accepted Manuscript
Download (171kB)
Text
Supplementary Material_20250622 - Accepted Manuscript
Download (3MB)
Text
Figure 1 - Accepted Manuscript
Download (301kB)
Text
Figure 2 - Accepted Manuscript
Download (2MB)
Text
Figure 3 - Accepted Manuscript
Download (787kB)
Text
Figure 4 - Accepted Manuscript
Download (770kB)
Text
Figure 5 - Accepted Manuscript
Download (683kB)
Text
Figure 6 - Accepted Manuscript
Download (60kB)
Text
Graphical abstract - Accepted Manuscript
Download (301kB)
Text
JCTH.2025.0(0).0.00270.Gong Feng - Version of Record
Download (3MB)

Show all 10 downloads.

More information

Accepted/In Press date: 2 July 2025
e-pub ahead of print date: 16 July 2025
Published date: 16 July 2025
Additional Information: Publisher Copyright: © 2025 The Author(s).
Keywords: Causal biomarkers, Machine learning, Mediation analysis, Mendelian randomization, Metabolic dysfunction-associated fatty liver disease, Non-invasive diagnosis, Prognosis, Proteomics

Identifiers

Local EPrints ID: 503947
URI: http://eprints.soton.ac.uk/id/eprint/503947
ISSN: 2225-0719
PURE UUID: dd0e1136-05fd-42a6-a62f-fb460f961e84
ORCID for Chrisopher D. Byrne: ORCID iD orcid.org/0000-0001-6322-7753

Catalogue record

Date deposited: 19 Aug 2025 16:31
Last modified: 30 Sep 2025 01:40

Export record

Altmetrics

Contributors

Author: Gong Feng
Author: Giovanni Targher
Author: Na He
Author: Man Mi
Author: Yi Liu
Author: Hongbin Zhu
Author: Ming-Hua Zheng
Author: Feng Ye

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×