Using explainable machine learning to identify patients at risk of reattendance at discharge from emergency departments
Using explainable machine learning to identify patients at risk of reattendance at discharge from emergency departments
Short-term reattendances to emergency departments are a key quality of care indicator. Identifying patients at increased risk of early reattendance could help reduce the number of missed critical illnesses and could reduce avoidable utilization of emergency departments by enabling targeted post-discharge intervention. In this manuscript, we present a retrospective, single-centre study where we created and evaluated an extreme gradient boosting decision tree model trained to identify patients at risk of reattendance within 72 h of discharge from an emergency department (University Hospitals Southampton Foundation Trust, UK). Our model was trained using 35,447 attendances by 28,945 patients and evaluated on a hold-out test set featuring 8847 attendances by 7237 patients. The set of attendances from a given patient appeared exclusively in either the training or the test set. Our model was trained using both visit level variables (e.g., vital signs, arrival mode, and chief complaint) and a set of variables available in a patients electronic patient record, such as age and any recorded medical conditions. On the hold-out test set, our highest performing model obtained an AUROC of 0.747 (95% CI 0.722–0.773) and an average precision of 0.233 (95% CI 0.194–0.277). These results demonstrate that machine-learning models can be used to classify patients, with moderate performance, into low and high-risk groups for reattendance. We explained our models predictions using SHAP values, a concept developed from coalitional game theory, capable of explaining predictions at an attendance level. We demonstrated how clustering techniques (the UMAP algorithm) can be used to investigate the different sub-groups of explanations present in our patient cohort.
Chmiel, F. P.
2de259aa-a5eb-460c-bfbf-8b44ed02e2bd
Burns, D. K.
40b9dc88-a54a-4365-b747-4456d9203146
Azor, M.
f857a719-496d-4b79-b28a-17af196a95d8
Borca, F.
31fc3965-6bcf-4fd6-85bc-8b0f99f62473
Boniface, M. J.
f30bfd7d-20ed-451b-b405-34e3e22fdfba
Zlatev, Z. D.
8f2e3635-d76c-46e2-85b9-53cc223fee01
White, N. M.
c7be4c26-e419-4e5c-9420-09fc02e2ac9c
Daniels, T. W.V.
d635a2fb-96a1-46ec-8cdf-8eb44a4bd0f5
Kiuber, M.
8feb60c2-8682-49e2-b8b6-bb1fcac34d1c
2 November 2021
Chmiel, F. P.
2de259aa-a5eb-460c-bfbf-8b44ed02e2bd
Burns, D. K.
40b9dc88-a54a-4365-b747-4456d9203146
Azor, M.
f857a719-496d-4b79-b28a-17af196a95d8
Borca, F.
31fc3965-6bcf-4fd6-85bc-8b0f99f62473
Boniface, M. J.
f30bfd7d-20ed-451b-b405-34e3e22fdfba
Zlatev, Z. D.
8f2e3635-d76c-46e2-85b9-53cc223fee01
White, N. M.
c7be4c26-e419-4e5c-9420-09fc02e2ac9c
Daniels, T. W.V.
d635a2fb-96a1-46ec-8cdf-8eb44a4bd0f5
Kiuber, M.
8feb60c2-8682-49e2-b8b6-bb1fcac34d1c
Chmiel, F. P., Burns, D. K., Azor, M., Borca, F., Boniface, M. J., Zlatev, Z. D., White, N. M., Daniels, T. W.V. and Kiuber, M.
(2021)
Using explainable machine learning to identify patients at risk of reattendance at discharge from emergency departments.
Scientific Reports, 11 (1), [21513].
(doi:10.1038/s41598-021-00937-9).
Abstract
Short-term reattendances to emergency departments are a key quality of care indicator. Identifying patients at increased risk of early reattendance could help reduce the number of missed critical illnesses and could reduce avoidable utilization of emergency departments by enabling targeted post-discharge intervention. In this manuscript, we present a retrospective, single-centre study where we created and evaluated an extreme gradient boosting decision tree model trained to identify patients at risk of reattendance within 72 h of discharge from an emergency department (University Hospitals Southampton Foundation Trust, UK). Our model was trained using 35,447 attendances by 28,945 patients and evaluated on a hold-out test set featuring 8847 attendances by 7237 patients. The set of attendances from a given patient appeared exclusively in either the training or the test set. Our model was trained using both visit level variables (e.g., vital signs, arrival mode, and chief complaint) and a set of variables available in a patients electronic patient record, such as age and any recorded medical conditions. On the hold-out test set, our highest performing model obtained an AUROC of 0.747 (95% CI 0.722–0.773) and an average precision of 0.233 (95% CI 0.194–0.277). These results demonstrate that machine-learning models can be used to classify patients, with moderate performance, into low and high-risk groups for reattendance. We explained our models predictions using SHAP values, a concept developed from coalitional game theory, capable of explaining predictions at an attendance level. We demonstrated how clustering techniques (the UMAP algorithm) can be used to investigate the different sub-groups of explanations present in our patient cohort.
Text
s41598-021-00937-9
- Version of Record
More information
Accepted/In Press date: 20 October 2021
Published date: 2 November 2021
Additional Information:
Funding Information:
This work was supported by The Alan Turing Institute under the EPSRC Grant EP/N510129/1. We acknowledge support from the NIHR Wessex ARC.
Identifiers
Local EPrints ID: 453810
URI: http://eprints.soton.ac.uk/id/eprint/453810
ISSN: 2045-2322
PURE UUID: 029dffa6-d6fc-45cd-becd-1b93b0ce9dc1
Catalogue record
Date deposited: 24 Jan 2022 17:53
Last modified: 18 Mar 2024 03:40
Export record
Altmetrics
Contributors
Author:
F. P. Chmiel
Author:
D. K. Burns
Author:
M. Azor
Author:
F. Borca
Author:
Z. D. Zlatev
Author:
N. M. White
Author:
T. W.V. Daniels
Author:
M. Kiuber
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics