The University of Southampton
University of Southampton Institutional Repository

Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy-confirmed NAFLD

Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy-confirmed NAFLD
Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy-confirmed NAFLD

Background: The presence of significant liver fibrosis is a key determinant of long-term prognosis in non-alcoholic fatty liver disease (NAFLD). We aimed to develop a novel machine learning algorithm (MLA) to predict fibrosis severity in NAFLD and compared it with the most widely used non-invasive fibrosis biomarkers. Methods: We used a cohort of 553 adults with biopsy-proven NAFLD, who were randomly divided into a training cohort (n = 278) for the development of both logistic regression model (LRM) and MLA, and a validation cohort (n = 275). Significant fibrosis was defined as fibrosis stage F ≥ 2. MLA and LRM were derived from variables that were selected using a least absolute shrinkage and selection operator (LASSO) logistic regression algorithm. Results: In the training cohort, the variables selected by LASSO algorithm were body mass index, pro-collagen type III, collagen type IV, aspartate aminotransferase and albumin-to-globulin ratio. The diagnostic accuracy of MLA showed the highest values of area under the receiver operator characteristic curve (AUROC: 0.902, 95% CI 0.869-0.904) for identifying fibrosis F ≥ 2. The LRM AUROC was 0.764, 95% CI 0.710-0.816 and significantly better than the AST-to-Platelet ratio (AUROC 0.684, 95% CI 0.605-0.762), FIB-4 score (AUROC 0.594, 95% CI 0.503-0.685) and NAFLD Fibrosis Score (AUROC 0.557, 95% CI 0.470-0.644). In the validation cohort, MLA also showed the highest AUROC (0.893, 95% CI 0.864-0.901). The diagnostic accuracy of MLA outperformed that of LRM in all subgroups considered. Conclusions: Our newly developed MLA algorithm has excellent diagnostic performance for predicting fibrosis F ≥ 2 in patients with biopsy-confirmed NAFLD.

NAFLD, diagnosis, fibrosis, liver biopsy, machine learning algorithm
593-603
Feng, Gong
8e56c77e-8da1-48f2-9560-24bc08ec0a57
Zheng, Kenneth I.
ebf42168-f891-4f68-8092-57320757c78e
Li, Yang-Yang
f7557249-d98b-4b57-91d0-1a2aca534bd3
Rios, Rafael S.
f332d7b7-ddff-4c75-a643-37fd5cd9c845
Zhu, Pei-Wu
cb200d5e-a53e-4901-906d-82d888b4fa5f
Pan, Xiao-Yan
27971dcf-4145-406d-ac81-3b2a71492d7a
Li, Gang
c0d5b9ec-2e7e-4748-95c5-23a01e213a2a
Ma, Hong-Lei
1e7d12e2-44d0-48f2-9516-f4b5bf4a3c79
Tang, Liang-Jie
36b218de-aa06-4c56-88d9-df5f6073e351
Byrne, Christopher
1370b997-cead-4229-83a7-53301ed2a43c
Targher, Giovanni
043e0811-b389-4922-974e-22e650212c5f
He, Na
8f8d16d2-1d22-4aca-911b-0cc231806003
Mi, Man
c03ec0a9-0407-4157-adb3-5db11b378fa9
Chen, Yong-Ping
c7164a42-06de-49fb-962f-ce458f736cdd
Zheng, Ming-Hua
5dcac971-00e3-4210-a1b0-a712b53eb5e8
Feng, Gong
8e56c77e-8da1-48f2-9560-24bc08ec0a57
Zheng, Kenneth I.
ebf42168-f891-4f68-8092-57320757c78e
Li, Yang-Yang
f7557249-d98b-4b57-91d0-1a2aca534bd3
Rios, Rafael S.
f332d7b7-ddff-4c75-a643-37fd5cd9c845
Zhu, Pei-Wu
cb200d5e-a53e-4901-906d-82d888b4fa5f
Pan, Xiao-Yan
27971dcf-4145-406d-ac81-3b2a71492d7a
Li, Gang
c0d5b9ec-2e7e-4748-95c5-23a01e213a2a
Ma, Hong-Lei
1e7d12e2-44d0-48f2-9516-f4b5bf4a3c79
Tang, Liang-Jie
36b218de-aa06-4c56-88d9-df5f6073e351
Byrne, Christopher
1370b997-cead-4229-83a7-53301ed2a43c
Targher, Giovanni
043e0811-b389-4922-974e-22e650212c5f
He, Na
8f8d16d2-1d22-4aca-911b-0cc231806003
Mi, Man
c03ec0a9-0407-4157-adb3-5db11b378fa9
Chen, Yong-Ping
c7164a42-06de-49fb-962f-ce458f736cdd
Zheng, Ming-Hua
5dcac971-00e3-4210-a1b0-a712b53eb5e8

Feng, Gong, Zheng, Kenneth I., Li, Yang-Yang, Rios, Rafael S., Zhu, Pei-Wu, Pan, Xiao-Yan, Li, Gang, Ma, Hong-Lei, Tang, Liang-Jie, Byrne, Christopher, Targher, Giovanni, He, Na, Mi, Man, Chen, Yong-Ping and Zheng, Ming-Hua (2021) Machine learning algorithm outperforms fibrosis markers in predicting significant fibrosis in biopsy-confirmed NAFLD. Journal of Hepato-Biliary-Pancreatic Sciences, 28 (7), 593-603. (doi:10.1002/jhbp.972).

Record type: Article

Abstract

Background: The presence of significant liver fibrosis is a key determinant of long-term prognosis in non-alcoholic fatty liver disease (NAFLD). We aimed to develop a novel machine learning algorithm (MLA) to predict fibrosis severity in NAFLD and compared it with the most widely used non-invasive fibrosis biomarkers. Methods: We used a cohort of 553 adults with biopsy-proven NAFLD, who were randomly divided into a training cohort (n = 278) for the development of both logistic regression model (LRM) and MLA, and a validation cohort (n = 275). Significant fibrosis was defined as fibrosis stage F ≥ 2. MLA and LRM were derived from variables that were selected using a least absolute shrinkage and selection operator (LASSO) logistic regression algorithm. Results: In the training cohort, the variables selected by LASSO algorithm were body mass index, pro-collagen type III, collagen type IV, aspartate aminotransferase and albumin-to-globulin ratio. The diagnostic accuracy of MLA showed the highest values of area under the receiver operator characteristic curve (AUROC: 0.902, 95% CI 0.869-0.904) for identifying fibrosis F ≥ 2. The LRM AUROC was 0.764, 95% CI 0.710-0.816 and significantly better than the AST-to-Platelet ratio (AUROC 0.684, 95% CI 0.605-0.762), FIB-4 score (AUROC 0.594, 95% CI 0.503-0.685) and NAFLD Fibrosis Score (AUROC 0.557, 95% CI 0.470-0.644). In the validation cohort, MLA also showed the highest AUROC (0.893, 95% CI 0.864-0.901). The diagnostic accuracy of MLA outperformed that of LRM in all subgroups considered. Conclusions: Our newly developed MLA algorithm has excellent diagnostic performance for predicting fibrosis F ≥ 2 in patients with biopsy-confirmed NAFLD.

Text
Journal of Hepato-Biliary-Pancreatic Sciences - Accepted Manuscript
Download (264kB)
Text
Hepato-Biliary_Pancreatic Sciences Table 1 - Accepted Manuscript
Download (176kB)
Text
Hepato-Biliary-Pancreatic Sciences Table 2 - Accepted Manuscript
Download (157kB)
Text
Hepato-Biliary-Pancreatic Sciences Table 3 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Text
Hepato-Biliary-Pancreatic Sciences_Table 4 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Figure 1 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Figure 2 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Figure 3 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Supplementary Figure 1 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Supplementary Figure 2 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Supplementary Figure 3 - Accepted Manuscript
Restricted to Repository staff only
Request a copy
Image
Supplementary Figure 4 - Accepted Manuscript
Restricted to Repository staff only
Request a copy

Show all 12 downloads.

More information

Accepted/In Press date: 2 April 2021
e-pub ahead of print date: 28 April 2021
Published date: July 2021
Additional Information: Funding Information: This work was supported by grants from the National Natural Science Foundation of China (82070588), High Level Creative Talents from Department of Public Health in Zhejiang Province (S2032102600032) and Project of New Century 551 Talent Nurturing in Wenzhou. Na He is supported in part by Special Research Fund of Youan Medical Alliance for the Liver and Infectious Diseases (LM202003). GT is supported in part by grants from the School of Medicine, University of Verona, Verona, Italy. CDB is supported in part by the Southampton NIHR Biomedical Research Centre (IS-BRC-20004), UK. We thank Prof. Ji-Min Liu, a liver pathologist from McMaster University, who conducted quality control of liver pathology data. Funding Information: This work was supported by grants from the National Natural Science Foundation of China (82070588), High Level Creative Talents from Department of Public Health in Zhejiang Province (S2032102600032) and Project of New Century 551 Talent Nurturing in Wenzhou. Na He is supported in part by Special Research Fund of Youan Medical Alliance for the Liver and Infectious Diseases (LM202003). GT is supported in part by grants from the School of Medicine, University of Verona, Verona, Italy. CDB is supported in part by the Southampton NIHR Biomedical Research Centre (IS‐BRC‐20004), UK. Publisher Copyright: © 2021 Japanese Society of Hepato-Biliary-Pancreatic Surgery
Keywords: NAFLD, diagnosis, fibrosis, liver biopsy, machine learning algorithm

Identifiers

Local EPrints ID: 448409
URI: http://eprints.soton.ac.uk/id/eprint/448409
PURE UUID: 9913f255-293b-4d1b-9cd9-5110595fc285
ORCID for Christopher Byrne: ORCID iD orcid.org/0000-0001-6322-7753

Catalogue record

Date deposited: 21 Apr 2021 16:35
Last modified: 17 Mar 2024 06:29

Export record

Altmetrics

Contributors

Author: Gong Feng
Author: Kenneth I. Zheng
Author: Yang-Yang Li
Author: Rafael S. Rios
Author: Pei-Wu Zhu
Author: Xiao-Yan Pan
Author: Gang Li
Author: Hong-Lei Ma
Author: Liang-Jie Tang
Author: Giovanni Targher
Author: Na He
Author: Man Mi
Author: Yong-Ping Chen
Author: Ming-Hua Zheng

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×