A systematic review of artificial intelligence and machine learning applications to inflammatory bowel disease, with practical guidelines for interpretation
A systematic review of artificial intelligence and machine learning applications to inflammatory bowel disease, with practical guidelines for interpretation
Background: Inflammatory bowel disease (IBD) is a gastrointestinal chronic disease with an unpredictable disease course. Computational methods such as machine learning (ML) have the potential to stratify IBD patients for the provision of individualised care. The use of ML methods for IBD was surveyed, with an additional focus on how the field has changed over time.
Methods: A systematic review was conducted through a search of MEDLINE and Embase databases, with the search structure (“machine learning” OR “artificial intelligence”) AND (“Crohn* Disease” OR “Ulcerative Colitis” OR “Inflammatory Bowel Disease”), searched 6th May 2021. Exclusion criteria: studies not written in English, no human patient data, publication before 2001, studies that were not peer reviewed, non-autoimmune disease comorbidity research and record types that were not primary research.
Results: 78 (of 409) records met the inclusion criteria. Random forest methods were most prevalent, and there was an increase in neural networks, mainly applied to imaging datasets. The main applications of ML to clinical tasks were diagnosis (18/78), disease course (22/78) and disease severity (16/78). The median sample size was 263. Clinical and microbiome-related datasets were most popular. 5% of studies used an external dataset after training and testing for additional model validation.
Discussion: Availability of longitudinal and deep phenotyping data could lead to better modelling. ML pipelines considering imbalanced data, and feature selection only on training data will generate more generalisable models. ML models are increasingly being applied to more complex clinical tasks for specific phenotypes, indicating progress towards personalised medicine for IBD.
Stafford, Imogen, Sian
50987dc1-3772-408f-9093-9124f3d6b2cd
Gosink, Mark M
68be1baa-5296-4987-9da5-4fa106202421
Mossotto, Enrico
a2a572db-3e95-41c6-94f6-f1b019594372
Ennis, Sarah
7b57f188-9d91-4beb-b217-09856146f1e9
Hauben, Manfred
588a4647-2188-45e1-ae46-e475520bdd58
Stafford, Imogen, Sian
50987dc1-3772-408f-9093-9124f3d6b2cd
Gosink, Mark M
68be1baa-5296-4987-9da5-4fa106202421
Mossotto, Enrico
a2a572db-3e95-41c6-94f6-f1b019594372
Ennis, Sarah
7b57f188-9d91-4beb-b217-09856146f1e9
Hauben, Manfred
588a4647-2188-45e1-ae46-e475520bdd58
Stafford, Imogen, Sian, Gosink, Mark M, Mossotto, Enrico, Ennis, Sarah and Hauben, Manfred
(2022)
A systematic review of artificial intelligence and machine learning applications to inflammatory bowel disease, with practical guidelines for interpretation.
Inflammatory Bowel Diseases, 28 (10).
(doi:10.1093/ibd/izac115).
(In Press)
Abstract
Background: Inflammatory bowel disease (IBD) is a gastrointestinal chronic disease with an unpredictable disease course. Computational methods such as machine learning (ML) have the potential to stratify IBD patients for the provision of individualised care. The use of ML methods for IBD was surveyed, with an additional focus on how the field has changed over time.
Methods: A systematic review was conducted through a search of MEDLINE and Embase databases, with the search structure (“machine learning” OR “artificial intelligence”) AND (“Crohn* Disease” OR “Ulcerative Colitis” OR “Inflammatory Bowel Disease”), searched 6th May 2021. Exclusion criteria: studies not written in English, no human patient data, publication before 2001, studies that were not peer reviewed, non-autoimmune disease comorbidity research and record types that were not primary research.
Results: 78 (of 409) records met the inclusion criteria. Random forest methods were most prevalent, and there was an increase in neural networks, mainly applied to imaging datasets. The main applications of ML to clinical tasks were diagnosis (18/78), disease course (22/78) and disease severity (16/78). The median sample size was 263. Clinical and microbiome-related datasets were most popular. 5% of studies used an external dataset after training and testing for additional model validation.
Discussion: Availability of longitudinal and deep phenotyping data could lead to better modelling. ML pipelines considering imbalanced data, and feature selection only on training data will generate more generalisable models. ML models are increasingly being applied to more complex clinical tasks for specific phenotypes, indicating progress towards personalised medicine for IBD.
Text
20220323_Systematic Review AI and IBD rev clean
- Accepted Manuscript
More information
Accepted/In Press date: 2 May 2022
Identifiers
Local EPrints ID: 457290
URI: http://eprints.soton.ac.uk/id/eprint/457290
ISSN: 1536-4844
PURE UUID: f2a4030d-b755-482c-a513-7a66c510140e
Catalogue record
Date deposited: 30 May 2022 17:06
Last modified: 18 Mar 2024 05:16
Export record
Altmetrics
Contributors
Author:
Imogen, Sian Stafford
Author:
Mark M Gosink
Author:
Enrico Mossotto
Author:
Manfred Hauben
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics