Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm
Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm
Background: children presenting to primary care with suspected type 1 diabetes should be referred immediately to secondary care to avoid life-threatening diabetic ketoacidosis. However, early recognition of children with type 1 diabetes is challenging. Children might not present with classic symptoms, or symptoms might be attributed to more common conditions. A quarter of children present with diabetic ketoacidosis, a proportion unchanged over 25 years. Our aim was to investigate whether a machine-learning algorithm could lead to earlier detection of type 1 diabetes in primary care.
Methods: we developed the predictive algorithm using Welsh primary care electronic health records (EHRs) linked to the Brecon Dataset, a register of children newly diagnosed with type 1 diabetes. Children were included from their first primary care record within the study period of Jan 1, 2000, to Dec 31, 2016, until either type 1 diabetes diagnosis, they turned 15 years of age, or study end. We developed an ensemble learner (SuperLearner) using 26 potential predictors. Validation of the algorithm was done in English EHRs from the Clinical Practice Research Datalink (primary care) and Hospital Episode Statistics, focusing on the ability of the algorithm to identify children who went on to develop type 1 diabetes and the time by which diagnosis could be anticipated.
Findings: the development dataset comprised 34 754 400 primary care contacts, relating to 952 402 children, and the validation dataset comprised 43 089 103 primary care contacts, relating to 1 493 328 children. Of these, 1829 (0·19%) children younger than 15 years in the development dataset, and 1516 (0·10%) in the validation dataset had a reliable date of type 1 diabetes diagnosis. If set to give an alert in 10% of contacts, an estimated 71·6% (95% CI 68·8–74·4) of the children with type 1 diabetes would receive an alert by the algorithm in the 90 days before diagnosis, with diagnosis anticipated, on average, by an estimated 9·34 days (95% CI 7·77–10·9).
Interpretation: if implemented into primary care settings, this predictive algorithm could substantially reduce the proportion of patients with new-onset type 1 diabetes presenting in diabetic ketoacidosis. Acceptability of alert thresholds should be explored in primary care.
Funding: Diabetes UK.
Adolescent, Algorithms, Child, Child, Preschool, Diabetes Mellitus, Type 1/diagnosis, Diabetic Ketoacidosis/diagnosis, Electronic Health Records, Female, Humans, Infant, Machine Learning, Male, Primary Health Care, United Kingdom
e386-e395
Daniel, Rhian
f9090db9-bf99-4fb5-a96c-aadfad9814b5
Jones, Hywel
08149e0a-2674-423f-ba25-d16303fbe3cd
Gregory, John W.
745ba9d8-0ece-488e-8c3f-6d6f9b8bf99d
Shetty, Ambika
9a0ea4f0-8c15-4c8c-ad92-c864eb320491
Francis, Nick
9b610883-605c-4fee-871d-defaa86ccf8e
Paranjothy, Shantini
42cb0c5b-8791-4226-aba3-97a7ace1a20c
Townson, Julia
bbbf583d-fd5e-4750-b3dc-02b7f2dac204
22 May 2024
Daniel, Rhian
f9090db9-bf99-4fb5-a96c-aadfad9814b5
Jones, Hywel
08149e0a-2674-423f-ba25-d16303fbe3cd
Gregory, John W.
745ba9d8-0ece-488e-8c3f-6d6f9b8bf99d
Shetty, Ambika
9a0ea4f0-8c15-4c8c-ad92-c864eb320491
Francis, Nick
9b610883-605c-4fee-871d-defaa86ccf8e
Paranjothy, Shantini
42cb0c5b-8791-4226-aba3-97a7ace1a20c
Townson, Julia
bbbf583d-fd5e-4750-b3dc-02b7f2dac204
Daniel, Rhian, Jones, Hywel, Gregory, John W., Shetty, Ambika, Francis, Nick, Paranjothy, Shantini and Townson, Julia
(2024)
Predicting type 1 diabetes in children using electronic health records in primary care in the UK: development and validation of a machine-learning algorithm.
The Lancet Digital Health, 6 (6), .
(doi:10.1016/S2589-7500(24)00050-5).
Abstract
Background: children presenting to primary care with suspected type 1 diabetes should be referred immediately to secondary care to avoid life-threatening diabetic ketoacidosis. However, early recognition of children with type 1 diabetes is challenging. Children might not present with classic symptoms, or symptoms might be attributed to more common conditions. A quarter of children present with diabetic ketoacidosis, a proportion unchanged over 25 years. Our aim was to investigate whether a machine-learning algorithm could lead to earlier detection of type 1 diabetes in primary care.
Methods: we developed the predictive algorithm using Welsh primary care electronic health records (EHRs) linked to the Brecon Dataset, a register of children newly diagnosed with type 1 diabetes. Children were included from their first primary care record within the study period of Jan 1, 2000, to Dec 31, 2016, until either type 1 diabetes diagnosis, they turned 15 years of age, or study end. We developed an ensemble learner (SuperLearner) using 26 potential predictors. Validation of the algorithm was done in English EHRs from the Clinical Practice Research Datalink (primary care) and Hospital Episode Statistics, focusing on the ability of the algorithm to identify children who went on to develop type 1 diabetes and the time by which diagnosis could be anticipated.
Findings: the development dataset comprised 34 754 400 primary care contacts, relating to 952 402 children, and the validation dataset comprised 43 089 103 primary care contacts, relating to 1 493 328 children. Of these, 1829 (0·19%) children younger than 15 years in the development dataset, and 1516 (0·10%) in the validation dataset had a reliable date of type 1 diabetes diagnosis. If set to give an alert in 10% of contacts, an estimated 71·6% (95% CI 68·8–74·4) of the children with type 1 diabetes would receive an alert by the algorithm in the 90 days before diagnosis, with diagnosis anticipated, on average, by an estimated 9·34 days (95% CI 7·77–10·9).
Interpretation: if implemented into primary care settings, this predictive algorithm could substantially reduce the proportion of patients with new-onset type 1 diabetes presenting in diabetic ketoacidosis. Acceptability of alert thresholds should be explored in primary care.
Funding: Diabetes UK.
Text
1-s2.0-S2589750024000505-main
- Version of Record
More information
e-pub ahead of print date: 22 May 2024
Published date: 22 May 2024
Keywords:
Adolescent, Algorithms, Child, Child, Preschool, Diabetes Mellitus, Type 1/diagnosis, Diabetic Ketoacidosis/diagnosis, Electronic Health Records, Female, Humans, Infant, Machine Learning, Male, Primary Health Care, United Kingdom
Identifiers
Local EPrints ID: 490875
URI: http://eprints.soton.ac.uk/id/eprint/490875
ISSN: 2589-7500
PURE UUID: bbb08267-e04a-4ce6-aad9-def7555fbc90
Catalogue record
Date deposited: 07 Jun 2024 16:43
Last modified: 08 Jun 2024 01:59
Export record
Altmetrics
Contributors
Author:
Rhian Daniel
Author:
Hywel Jones
Author:
John W. Gregory
Author:
Ambika Shetty
Author:
Shantini Paranjothy
Author:
Julia Townson
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics