Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa
Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa
This study aims to explore the potential of Internet of Things (IoT) devices and explainable Artificial Intelligence (AI) techniques in predicting biomarker values associated with GDM when measured 13–16 weeks prior to diagnosis. We developed a system that forecasts biomarkers such as LDL, HDL, triglycerides, cholesterol, HbA1c, and results from the Oral Glucose Tolerance Test (OGTT) including fasting glucose, 1-hour, and 2-hour post-load glucose values. These biomarker values are predicted based on sensory measurements collected around week 12 of pregnancy, including continuous glucose levels, short physical movement recordings, and medical background information. To the best of our knowledge, this is the first study to forecast GDM-associated biomarker values 13 to 16 weeks prior to the GDM screening test, using continuous glucose monitoring devices, a wristband for activity detection, and medical background data. We applied machine learning models, specifically Decision Tree and Random Forest Regressors, along with Coupled-Matrix Tensor Factorisation (CMTF) and Elastic Net techniques, examining all possible combinations of these methods across different data modalities. The results demonstrated good performance for most biomarkers. On average, the models achieved Mean Squared Error (MSE) between 0.29 and 0.42 and Mean Absolute Error (MAE) between 0.23 and 0.45 for biomarkers like HDL, LDL, cholesterol, and HbA1c. For the OGTT glucose values, the average MSE ranged from 0.95 to 2.44, and the average MAE ranged from 0.72 to 0.91. Additionally, the utilisation of CMTF with Alternating Least Squares technique yielded slightly better results (0.16 MSE and 0.07 MAE on average) compared to the well-known Elastic Net feature selection technique. While our study was conducted with a limited cohort in South Africa, our findings offer promising indications regarding the potential for predicting biomarker values in pregnant women through the integration of wearable devices and medical background data in the analysis. Nevertheless, further validation on a larger, more diverse cohort is imperative to substantiate these encouraging results.
coupled-matrix tensor factorisation, explainable AI models, gestational diabetes mellitus, Internet of Things healthcare, remote sensing, tree-based regressors
1860-1871
Kolozali, Şefki
23ec37c0-4c78-4dfe-ab34-191f5f2d9709
White, Sara L.
7c2d382b-89ff-4535-9f86-b542f0e7d20e
Norris, Shane
1d346f1b-6d5f-4bca-ac87-7589851b75a4
Fasli, Maria
0628512e-ac16-48a8-b679-b940f61dd45e
Heerden, Alastair van
9a8f1413-27ef-4a2b-867b-63bbcdfd3379
12 February 2024
Kolozali, Şefki
23ec37c0-4c78-4dfe-ab34-191f5f2d9709
White, Sara L.
7c2d382b-89ff-4535-9f86-b542f0e7d20e
Norris, Shane
1d346f1b-6d5f-4bca-ac87-7589851b75a4
Fasli, Maria
0628512e-ac16-48a8-b679-b940f61dd45e
Heerden, Alastair van
9a8f1413-27ef-4a2b-867b-63bbcdfd3379
Kolozali, Şefki, White, Sara L., Norris, Shane, Fasli, Maria and Heerden, Alastair van
(2024)
Explainable early prediction of gestational diabetes biomarkers by combining medical background and wearable devices: a pilot study with a cohort group in South Africa.
IEEE Journal of Biomedical and Health Informatics, 28 (4), .
(doi:10.1109/JBHI.2024.3361505).
Abstract
This study aims to explore the potential of Internet of Things (IoT) devices and explainable Artificial Intelligence (AI) techniques in predicting biomarker values associated with GDM when measured 13–16 weeks prior to diagnosis. We developed a system that forecasts biomarkers such as LDL, HDL, triglycerides, cholesterol, HbA1c, and results from the Oral Glucose Tolerance Test (OGTT) including fasting glucose, 1-hour, and 2-hour post-load glucose values. These biomarker values are predicted based on sensory measurements collected around week 12 of pregnancy, including continuous glucose levels, short physical movement recordings, and medical background information. To the best of our knowledge, this is the first study to forecast GDM-associated biomarker values 13 to 16 weeks prior to the GDM screening test, using continuous glucose monitoring devices, a wristband for activity detection, and medical background data. We applied machine learning models, specifically Decision Tree and Random Forest Regressors, along with Coupled-Matrix Tensor Factorisation (CMTF) and Elastic Net techniques, examining all possible combinations of these methods across different data modalities. The results demonstrated good performance for most biomarkers. On average, the models achieved Mean Squared Error (MSE) between 0.29 and 0.42 and Mean Absolute Error (MAE) between 0.23 and 0.45 for biomarkers like HDL, LDL, cholesterol, and HbA1c. For the OGTT glucose values, the average MSE ranged from 0.95 to 2.44, and the average MAE ranged from 0.72 to 0.91. Additionally, the utilisation of CMTF with Alternating Least Squares technique yielded slightly better results (0.16 MSE and 0.07 MAE on average) compared to the well-known Elastic Net feature selection technique. While our study was conducted with a limited cohort in South Africa, our findings offer promising indications regarding the potential for predicting biomarker values in pregnant women through the integration of wearable devices and medical background data in the analysis. Nevertheless, further validation on a larger, more diverse cohort is imperative to substantiate these encouraging results.
This record has no associated files available for download.
More information
Accepted/In Press date: 27 January 2024
Published date: 12 February 2024
Keywords:
coupled-matrix tensor factorisation, explainable AI models, gestational diabetes mellitus, Internet of Things healthcare, remote sensing, tree-based regressors
Identifiers
Local EPrints ID: 496966
URI: http://eprints.soton.ac.uk/id/eprint/496966
ISSN: 2168-2194
PURE UUID: 4acade44-4811-414b-ba03-8fdfac0d60e9
Catalogue record
Date deposited: 08 Jan 2025 15:38
Last modified: 10 Jan 2025 03:05
Export record
Altmetrics
Contributors
Author:
Şefki Kolozali
Author:
Sara L. White
Author:
Maria Fasli
Author:
Alastair van Heerden
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics