Anticipating type 1 diabetes in children using machine learning and routinely-collected data in primary care
Anticipating type 1 diabetes in children using machine learning and routinely-collected data in primary care
Oral abstracts:Background and aim: Delayed or misdiagnosis are significant risk factors for children presenting in diabetic ketoacidosis (DKA) at diagnosis. Symptoms may be attributed to more common conditions, making diagnosis challenging. We aimed to develop and validate a predictive model to identify type 1 diabetes based on coded data within the primary care electronic medical record.
Methods: To develop the algorithm we used routinely collected primary care data (Secured Anonymised Information Linkage (SAIL) databank), linked with secondary care data (Brecon Register), in Welsh children <15 years, within 01/01/00 to 31/12/16. Codes relating to primary care contacts were grouped into 25 pre-determined categories. We used machine learning (SuperLearner) to train the predictive algorithm, and tested it using Clinical Practice Research Datalink (CPRD) (English primary care data) linked with Hospital Episode Statistics (type 1 diabetes diagnosis) data.
Results: The algorithm was trained on data from 1.2 million children, with 35 million general practitioner (GP) contacts (SAIL), 2051 (0.21%) diagnosed with type 1 diabetes (DKA 26%) (Brecon). The test data from CPRD included 1.5 million children and 43 million GP contacts. 1516 (0.10%) were determined newly diagnosed (DKA 26%). Using a definition of effort-benefit trade-off (benefit = patients diagnosed, effort = GP contacts) the estimated AUC was 0.906. Using a 10% effort threshold, 75% of children with type 1 diabetes were identified by the algorithm 90 days prior to diagnosis. 25% of children would have their diagnosis anticipated by at least a week
Townson, J.
bbbf583d-fd5e-4750-b3dc-02b7f2dac204
Daniel, R.
f9090db9-bf99-4fb5-a96c-aadfad9814b5
Jones, H.
72e80555-02a1-41fe-9a6b-0d7d8b3cd72a
Francis, N. A.
9b610883-605c-4fee-871d-defaa86ccf8e
Paranjothy, S.
04acae3d-1dba-48ee-80e4-6f4b85cb8043
Baldwin, B.
6681a526-5832-4a90-bfd9-73df89c76b75
Thayer, D.
c4dec024-450c-4cf0-8ced-eb20a1605a0e
Gregory, J. W.
8008613e-992f-408e-b5dc-ecfeae0c2b94
Townson, J.
bbbf583d-fd5e-4750-b3dc-02b7f2dac204
Daniel, R.
f9090db9-bf99-4fb5-a96c-aadfad9814b5
Jones, H.
72e80555-02a1-41fe-9a6b-0d7d8b3cd72a
Francis, N. A.
9b610883-605c-4fee-871d-defaa86ccf8e
Paranjothy, S.
04acae3d-1dba-48ee-80e4-6f4b85cb8043
Baldwin, B.
6681a526-5832-4a90-bfd9-73df89c76b75
Thayer, D.
c4dec024-450c-4cf0-8ced-eb20a1605a0e
Gregory, J. W.
8008613e-992f-408e-b5dc-ecfeae0c2b94
Townson, J., Daniel, R., Jones, H., Francis, N. A., Paranjothy, S., Baldwin, B., Thayer, D. and Gregory, J. W.
(2022)
Anticipating type 1 diabetes in children using machine learning and routinely-collected data in primary care.
Diabetic Medicine, 39 (S1), [e14809].
(doi:10.1111/dme.14809).
Record type:
Meeting abstract
Abstract
Oral abstracts:Background and aim: Delayed or misdiagnosis are significant risk factors for children presenting in diabetic ketoacidosis (DKA) at diagnosis. Symptoms may be attributed to more common conditions, making diagnosis challenging. We aimed to develop and validate a predictive model to identify type 1 diabetes based on coded data within the primary care electronic medical record.
Methods: To develop the algorithm we used routinely collected primary care data (Secured Anonymised Information Linkage (SAIL) databank), linked with secondary care data (Brecon Register), in Welsh children <15 years, within 01/01/00 to 31/12/16. Codes relating to primary care contacts were grouped into 25 pre-determined categories. We used machine learning (SuperLearner) to train the predictive algorithm, and tested it using Clinical Practice Research Datalink (CPRD) (English primary care data) linked with Hospital Episode Statistics (type 1 diabetes diagnosis) data.
Results: The algorithm was trained on data from 1.2 million children, with 35 million general practitioner (GP) contacts (SAIL), 2051 (0.21%) diagnosed with type 1 diabetes (DKA 26%) (Brecon). The test data from CPRD included 1.5 million children and 43 million GP contacts. 1516 (0.10%) were determined newly diagnosed (DKA 26%). Using a definition of effort-benefit trade-off (benefit = patients diagnosed, effort = GP contacts) the estimated AUC was 0.906. Using a 10% effort threshold, 75% of children with type 1 diabetes were identified by the algorithm 90 days prior to diagnosis. 25% of children would have their diagnosis anticipated by at least a week
This record has no associated files available for download.
More information
e-pub ahead of print date: 28 March 2022
Venue - Dates:
Diabetes UK Professional Conference 2022, QEII Centre, London, United Kingdom, 2022-03-29 - 2022-04-01
Identifiers
Local EPrints ID: 474015
URI: http://eprints.soton.ac.uk/id/eprint/474015
ISSN: 0742-3071
PURE UUID: 1ee3e1d6-621c-4acc-ad49-243fbe4157cf
Catalogue record
Date deposited: 08 Feb 2023 18:02
Last modified: 17 Mar 2024 03:58
Export record
Altmetrics
Contributors
Author:
J. Townson
Author:
R. Daniel
Author:
H. Jones
Author:
S. Paranjothy
Author:
B. Baldwin
Author:
D. Thayer
Author:
J. W. Gregory
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics