The University of Southampton
University of Southampton Institutional Repository

Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa

Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa
Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa
This study aims to explore the potential of Internet of Things (IoT) devices and explainable Artificial Intelligence (AI) techniques in predicting biomarker values associated with GDM when measured 13–16 weeks prior to diagnosis. We developed a system that forecasts biomarkers such as LDL, HDL, triglycerides, cholesterol, HbA1c, and results from the Oral Glucose Tolerance Test (OGTT) including fasting glucose, 1-hour, and 2-hour post-load glucose values. These biomarker values are predicted based on sensory measurements collected around week 12 of pregnancy, including continuous glucose levels, short physical movement recordings, and medical background information. To the best of our knowledge, this is the first study to forecast GDM-associated biomarker values 13 to 16 weeks prior to the GDM screening test, using continuous glucose monitoring devices, a wristband for activity detection, and medical background data. We applied machine learning models, specifically Decision Tree and Random Forest Regressors, along with Coupled-Matrix Tensor Factorisation (CMTF) and Elastic Net techniques, examining all possible combinations of these methods across different data modalities. The results demonstrated good performance for most biomarkers. On average, the models achieved Mean Squared Error (MSE) between 0.29 and 0.42 and Mean Absolute Error (MAE) between 0.23 and 0.45 for biomarkers like HDL, LDL, cholesterol, and HbA1c. For the OGTT glucose values, the average MSE ranged from 0.95 to 2.44, and the average MAE ranged from 0.72 to 0.91. Additionally, the utilisation of CMTF with Alternating Least Squares technique yielded slightly better results (0.16 MSE and 0.07 MAE on average) compared to the well-known Elastic Net feature selection technique. While our study was conducted with a limited cohort in South Africa, our findings offer promising indications regarding the potential for predicting biomarker values in pregnant women through the integration of wearable devices and medical background data in the analysis. Nevertheless, further validation on a larger, more diverse cohort is imperative to substantiate these encouraging results.
coupled-matrix tensor factorisation, explainable AI models, gestational diabetes mellitus, Internet of Things healthcare, remote sensing, tree-based regressors
2168-2194
1860-1871
Kolozali, Şefki
23ec37c0-4c78-4dfe-ab34-191f5f2d9709
White, Sara L.
7c2d382b-89ff-4535-9f86-b542f0e7d20e
Norris, Shane
1d346f1b-6d5f-4bca-ac87-7589851b75a4
Fasli, Maria
0628512e-ac16-48a8-b679-b940f61dd45e
Heerden, Alastair van
9a8f1413-27ef-4a2b-867b-63bbcdfd3379
Kolozali, Şefki
23ec37c0-4c78-4dfe-ab34-191f5f2d9709
White, Sara L.
7c2d382b-89ff-4535-9f86-b542f0e7d20e
Norris, Shane
1d346f1b-6d5f-4bca-ac87-7589851b75a4
Fasli, Maria
0628512e-ac16-48a8-b679-b940f61dd45e
Heerden, Alastair van
9a8f1413-27ef-4a2b-867b-63bbcdfd3379

Kolozali, Şefki, White, Sara L., Norris, Shane, Fasli, Maria and Heerden, Alastair van (2024) Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa. IEEE Journal of Biomedical and Health Informatics, 28 (4), 1860-1871. (doi:10.1109/JBHI.2024.3361505).

Record type: Article

Abstract

This study aims to explore the potential of Internet of Things (IoT) devices and explainable Artificial Intelligence (AI) techniques in predicting biomarker values associated with GDM when measured 13–16 weeks prior to diagnosis. We developed a system that forecasts biomarkers such as LDL, HDL, triglycerides, cholesterol, HbA1c, and results from the Oral Glucose Tolerance Test (OGTT) including fasting glucose, 1-hour, and 2-hour post-load glucose values. These biomarker values are predicted based on sensory measurements collected around week 12 of pregnancy, including continuous glucose levels, short physical movement recordings, and medical background information. To the best of our knowledge, this is the first study to forecast GDM-associated biomarker values 13 to 16 weeks prior to the GDM screening test, using continuous glucose monitoring devices, a wristband for activity detection, and medical background data. We applied machine learning models, specifically Decision Tree and Random Forest Regressors, along with Coupled-Matrix Tensor Factorisation (CMTF) and Elastic Net techniques, examining all possible combinations of these methods across different data modalities. The results demonstrated good performance for most biomarkers. On average, the models achieved Mean Squared Error (MSE) between 0.29 and 0.42 and Mean Absolute Error (MAE) between 0.23 and 0.45 for biomarkers like HDL, LDL, cholesterol, and HbA1c. For the OGTT glucose values, the average MSE ranged from 0.95 to 2.44, and the average MAE ranged from 0.72 to 0.91. Additionally, the utilisation of CMTF with Alternating Least Squares technique yielded slightly better results (0.16 MSE and 0.07 MAE on average) compared to the well-known Elastic Net feature selection technique. While our study was conducted with a limited cohort in South Africa, our findings offer promising indications regarding the potential for predicting biomarker values in pregnant women through the integration of wearable devices and medical background data in the analysis. Nevertheless, further validation on a larger, more diverse cohort is imperative to substantiate these encouraging results.

This record has no associated files available for download.

More information

Accepted/In Press date: 27 January 2024
Published date: 12 February 2024
Keywords: coupled-matrix tensor factorisation, explainable AI models, gestational diabetes mellitus, Internet of Things healthcare, remote sensing, tree-based regressors

Identifiers

Local EPrints ID: 496966
URI: http://eprints.soton.ac.uk/id/eprint/496966
ISSN: 2168-2194
PURE UUID: 4acade44-4811-414b-ba03-8fdfac0d60e9
ORCID for Shane Norris: ORCID iD orcid.org/0000-0001-7124-3788

Catalogue record

Date deposited: 08 Jan 2025 15:38
Last modified: 10 Jan 2025 03:05

Export record

Altmetrics

Contributors

Author: Şefki Kolozali
Author: Sara L. White
Author: Shane Norris ORCID iD
Author: Maria Fasli
Author: Alastair van Heerden

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×