Evaluating listening performance for COVID-19 detection by clinicians and machine learning: comparative study
Evaluating listening performance for COVID-19 detection by clinicians and machine learning: comparative study
Background: to date, performance comparisons between men and machines have been carried out in many health domains. Yet machine learning (ML) models and human performance comparisons in audio-based respiratory diagnosis remain largely unexplored.
Objective: the primary objective of this study was to compare human clinicians and an ML model in predicting COVID-19 from respiratory sound recordings.
Methods: in this study, we compared human clinicians and an ML model in predicting COVID-19 from respiratory sound recordings. Prediction performance on 24 audio samples (12 tested positive) made by 36 clinicians with experience in treating COVID-19 or other respiratory illnesses was compared with predictions made by an ML model trained on 1162 samples. Each sample consisted of voice, cough, and breathing sound recordings from 1 subject, and the length of each sample was around 20 seconds. We also investigated whether combining the predictions of the model and human experts could further enhance the performance in terms of both accuracy and confidence.
Results: the ML model outperformed the clinicians, yielding a sensitivity of 0.75 and a specificity of 0.83, whereas the best performance achieved by the clinicians was 0.67 in terms of sensitivity and 0.75 in terms of specificity. Integrating the clinicians’ and the model’s predictions, however, could enhance performance further, achieving a sensitivity of 0.83 and a specificity of 0.92.
Conclusions: our findings suggest that the clinicians and the ML model could make better clinical decisions via a cooperative approach and achieve higher confidence in audio-based respiratory diagnosis.
Han, Jing
0e18bcab-5434-4606-b635-70fb07250322
Montagna, Marco
1e04949f-87c3-4ca4-a772-5d59a9407f07
Grammenos, Andreas
a0e7ff5f-2149-4aab-b3e3-6733d7290659
Xia, Tong
763cc71a-eaf1-4902-a27d-62cd5912a569
Bondareva, Erika
7126e0d3-4e24-4e59-90bc-d56aaa8a639c
Siegele-Brown, Chloë
b65ca127-9b66-4a6e-9e3b-95c23755732d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Dang, Ting
1ef75748-8531-43ef-9edd-9553d0899940
Spathis, Dimitris
0352ccc2-a97c-4273-998a-0bc8546612fa
Floto, R. Andres
79ab6e97-aee4-441c-81ba-8d829c057043
Cicuta, Pietro
69dd83a4-3a88-4801-919f-d7015910b11a
Mascolo, Cecilia
fbee25b9-943f-4759-8ca3-ae228af3cab0
9 May 2023
Han, Jing
0e18bcab-5434-4606-b635-70fb07250322
Montagna, Marco
1e04949f-87c3-4ca4-a772-5d59a9407f07
Grammenos, Andreas
a0e7ff5f-2149-4aab-b3e3-6733d7290659
Xia, Tong
763cc71a-eaf1-4902-a27d-62cd5912a569
Bondareva, Erika
7126e0d3-4e24-4e59-90bc-d56aaa8a639c
Siegele-Brown, Chloë
b65ca127-9b66-4a6e-9e3b-95c23755732d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Dang, Ting
1ef75748-8531-43ef-9edd-9553d0899940
Spathis, Dimitris
0352ccc2-a97c-4273-998a-0bc8546612fa
Floto, R. Andres
79ab6e97-aee4-441c-81ba-8d829c057043
Cicuta, Pietro
69dd83a4-3a88-4801-919f-d7015910b11a
Mascolo, Cecilia
fbee25b9-943f-4759-8ca3-ae228af3cab0
Han, Jing, Montagna, Marco, Grammenos, Andreas, Xia, Tong, Bondareva, Erika, Siegele-Brown, Chloë, Chauhan, Jagmohan, Dang, Ting, Spathis, Dimitris, Floto, R. Andres, Cicuta, Pietro and Mascolo, Cecilia
(2023)
Evaluating listening performance for COVID-19 detection by clinicians and machine learning: comparative study.
Journal of Medical Internet Research, 25, [e44804].
(doi:10.2196/44804).
Abstract
Background: to date, performance comparisons between men and machines have been carried out in many health domains. Yet machine learning (ML) models and human performance comparisons in audio-based respiratory diagnosis remain largely unexplored.
Objective: the primary objective of this study was to compare human clinicians and an ML model in predicting COVID-19 from respiratory sound recordings.
Methods: in this study, we compared human clinicians and an ML model in predicting COVID-19 from respiratory sound recordings. Prediction performance on 24 audio samples (12 tested positive) made by 36 clinicians with experience in treating COVID-19 or other respiratory illnesses was compared with predictions made by an ML model trained on 1162 samples. Each sample consisted of voice, cough, and breathing sound recordings from 1 subject, and the length of each sample was around 20 seconds. We also investigated whether combining the predictions of the model and human experts could further enhance the performance in terms of both accuracy and confidence.
Results: the ML model outperformed the clinicians, yielding a sensitivity of 0.75 and a specificity of 0.83, whereas the best performance achieved by the clinicians was 0.67 in terms of sensitivity and 0.75 in terms of specificity. Integrating the clinicians’ and the model’s predictions, however, could enhance performance further, achieving a sensitivity of 0.83 and a specificity of 0.92.
Conclusions: our findings suggest that the clinicians and the ML model could make better clinical decisions via a cooperative approach and achieve higher confidence in audio-based respiratory diagnosis.
Text
jmir-2023-1-e44804
- Version of Record
More information
Accepted/In Press date: 28 April 2023
Published date: 9 May 2023
Identifiers
Local EPrints ID: 491171
URI: http://eprints.soton.ac.uk/id/eprint/491171
ISSN: 1438-8871
PURE UUID: 3d33c7f9-8220-4a84-9871-28e558ed21fc
Catalogue record
Date deposited: 13 Jun 2024 17:13
Last modified: 14 Jun 2024 17:22
Export record
Altmetrics
Contributors
Author:
Jing Han
Author:
Marco Montagna
Author:
Andreas Grammenos
Author:
Tong Xia
Author:
Erika Bondareva
Author:
Chloë Siegele-Brown
Author:
Jagmohan Chauhan
Author:
Ting Dang
Author:
Dimitris Spathis
Author:
R. Andres Floto
Author:
Pietro Cicuta
Author:
Cecilia Mascolo
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics