Exploring longitudinal cough, breath, and voice data for COVID-19 progression prediction via sequential deep learning: model development and validation
Exploring longitudinal cough, breath, and voice data for COVID-19 progression prediction via sequential deep learning: model development and validation
Background: recent work has shown the potential of using audio data (eg, cough, breathing, and voice) in the screening for COVID-19. However, these approaches only focus on one-off detection and detect the infection, given the current audio sample, but do not monitor disease progression in COVID-19. Limited exploration has been put forward to continuously monitor COVID-19 progression, especially recovery, through longitudinal audio data. Tracking disease progression characteristics and patterns of recovery could bring insights and lead to more timely treatment or treatment adjustment, as well as better resource management in health care systems.
Objective: the primary objective of this study is to explore the potential of longitudinal audio samples over time for COVID-19 progression prediction and, especially, recovery trend prediction using sequential deep learning techniques.
Methods: crowdsourced respiratory audio data, including breathing, cough, and voice samples, from 212 individuals over 5-385 days were analyzed, alongside their self-reported COVID-19 test results. We developed and validated a deep learning–enabled tracking tool using gated recurrent units (GRUs) to detect COVID-19 progression by exploring the audio dynamics of the individuals’ historical audio biomarkers. The investigation comprised 2 parts: (1) COVID-19 detection in terms of positive and negative (healthy) tests using sequential audio signals, which was primarily assessed in terms of the area under the receiver operating characteristic curve (AUROC), sensitivity, and specificity, with 95% CIs, and (2) longitudinal disease progression prediction over time in terms of probability of positive tests, which was evaluated using the correlation between the predicted probability trajectory and self-reported labels.
Results: we first explored the benefits of capturing longitudinal dynamics of audio biomarkers for COVID-19 detection. The strong performance, yielding an AUROC of 0.79, a sensitivity of 0.75, and a specificity of 0.71 supported the effectiveness of the approach compared to methods that do not leverage longitudinal dynamics. We further examined the predicted disease progression trajectory, which displayed high consistency with longitudinal test results with a correlation of 0.75 in the test cohort and 0.86 in a subset of the test cohort with 12 (57.1%) of 21 COVID-19–positive participants who reported disease recovery. Our findings suggest that monitoring COVID-19 evolution via longitudinal audio data has potential in the tracking of individuals’ disease progression and recovery.
Conclusions: an audio-based COVID-19 progression monitoring system was developed using deep learning techniques, with strong performance showing high consistency between the predicted trajectory and the test results over time, especially for recovery trend predictions. This has good potential in the postpeak and postpandemic era that can help guide medical treatment and optimize hospital resource allocations. The changes in longitudinal audio samples, referred to as audio dynamics, are associated with COVID-19 progression; thus, modeling the audio dynamics can potentially capture the underlying disease progression process and further aid COVID-19 progression prediction. This framework provides a flexible, affordable, and timely tool for COVID-19 tracking, and more importantly, it also provides a proof of concept of how telemonitoring could be applicable to respiratory diseases monitoring, in general.
audio, COVID-19, COVID-19 progression, deep learning, longitudinal study, mobile health
Dang, Ting
1ef75748-8531-43ef-9edd-9553d0899940
Han, Jing
0e18bcab-5434-4606-b635-70fb07250322
Xia, Tong
763cc71a-eaf1-4902-a27d-62cd5912a569
Spathis, Dimitris
0352ccc2-a97c-4273-998a-0bc8546612fa
Bondareva, Erika
7126e0d3-4e24-4e59-90bc-d56aaa8a639c
Siegele-Brown, Chloë
b65ca127-9b66-4a6e-9e3b-95c23755732d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Grammenos, Andreas
a0e7ff5f-2149-4aab-b3e3-6733d7290659
Hasthanasombat, Apinan
94b01385-5bf6-455d-a1f9-06a9c3149ba5
Floto, R. Andres
79ab6e97-aee4-441c-81ba-8d829c057043
Cicuta, Pietro
80bc9499-6c6a-4d7a-8d45-b5b3b76a4695
Mascolo, Cecilia
e4a7bcf7-72c8-43b7-b6b3-4f8980da245d
21 June 2022
Dang, Ting
1ef75748-8531-43ef-9edd-9553d0899940
Han, Jing
0e18bcab-5434-4606-b635-70fb07250322
Xia, Tong
763cc71a-eaf1-4902-a27d-62cd5912a569
Spathis, Dimitris
0352ccc2-a97c-4273-998a-0bc8546612fa
Bondareva, Erika
7126e0d3-4e24-4e59-90bc-d56aaa8a639c
Siegele-Brown, Chloë
b65ca127-9b66-4a6e-9e3b-95c23755732d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Grammenos, Andreas
a0e7ff5f-2149-4aab-b3e3-6733d7290659
Hasthanasombat, Apinan
94b01385-5bf6-455d-a1f9-06a9c3149ba5
Floto, R. Andres
79ab6e97-aee4-441c-81ba-8d829c057043
Cicuta, Pietro
80bc9499-6c6a-4d7a-8d45-b5b3b76a4695
Mascolo, Cecilia
e4a7bcf7-72c8-43b7-b6b3-4f8980da245d
Dang, Ting, Han, Jing, Xia, Tong, Spathis, Dimitris, Bondareva, Erika, Siegele-Brown, Chloë, Chauhan, Jagmohan, Grammenos, Andreas, Hasthanasombat, Apinan, Floto, R. Andres, Cicuta, Pietro and Mascolo, Cecilia
(2022)
Exploring longitudinal cough, breath, and voice data for COVID-19 progression prediction via sequential deep learning: model development and validation.
Journal of Medical Internet Research, 24 (6), [e37004].
(doi:10.2196/37004).
Abstract
Background: recent work has shown the potential of using audio data (eg, cough, breathing, and voice) in the screening for COVID-19. However, these approaches only focus on one-off detection and detect the infection, given the current audio sample, but do not monitor disease progression in COVID-19. Limited exploration has been put forward to continuously monitor COVID-19 progression, especially recovery, through longitudinal audio data. Tracking disease progression characteristics and patterns of recovery could bring insights and lead to more timely treatment or treatment adjustment, as well as better resource management in health care systems.
Objective: the primary objective of this study is to explore the potential of longitudinal audio samples over time for COVID-19 progression prediction and, especially, recovery trend prediction using sequential deep learning techniques.
Methods: crowdsourced respiratory audio data, including breathing, cough, and voice samples, from 212 individuals over 5-385 days were analyzed, alongside their self-reported COVID-19 test results. We developed and validated a deep learning–enabled tracking tool using gated recurrent units (GRUs) to detect COVID-19 progression by exploring the audio dynamics of the individuals’ historical audio biomarkers. The investigation comprised 2 parts: (1) COVID-19 detection in terms of positive and negative (healthy) tests using sequential audio signals, which was primarily assessed in terms of the area under the receiver operating characteristic curve (AUROC), sensitivity, and specificity, with 95% CIs, and (2) longitudinal disease progression prediction over time in terms of probability of positive tests, which was evaluated using the correlation between the predicted probability trajectory and self-reported labels.
Results: we first explored the benefits of capturing longitudinal dynamics of audio biomarkers for COVID-19 detection. The strong performance, yielding an AUROC of 0.79, a sensitivity of 0.75, and a specificity of 0.71 supported the effectiveness of the approach compared to methods that do not leverage longitudinal dynamics. We further examined the predicted disease progression trajectory, which displayed high consistency with longitudinal test results with a correlation of 0.75 in the test cohort and 0.86 in a subset of the test cohort with 12 (57.1%) of 21 COVID-19–positive participants who reported disease recovery. Our findings suggest that monitoring COVID-19 evolution via longitudinal audio data has potential in the tracking of individuals’ disease progression and recovery.
Conclusions: an audio-based COVID-19 progression monitoring system was developed using deep learning techniques, with strong performance showing high consistency between the predicted trajectory and the test results over time, especially for recovery trend predictions. This has good potential in the postpeak and postpandemic era that can help guide medical treatment and optimize hospital resource allocations. The changes in longitudinal audio samples, referred to as audio dynamics, are associated with COVID-19 progression; thus, modeling the audio dynamics can potentially capture the underlying disease progression process and further aid COVID-19 progression prediction. This framework provides a flexible, affordable, and timely tool for COVID-19 tracking, and more importantly, it also provides a proof of concept of how telemonitoring could be applicable to respiratory diseases monitoring, in general.
Text
jmir-2022-6-e37004
- Version of Record
More information
Published date: 21 June 2022
Keywords:
audio, COVID-19, COVID-19 progression, deep learning, longitudinal study, mobile health
Identifiers
Local EPrints ID: 491127
URI: http://eprints.soton.ac.uk/id/eprint/491127
ISSN: 1438-8871
PURE UUID: 3d73d27e-cd6d-4c5a-a1a8-012cccdbc2b8
Catalogue record
Date deposited: 13 Jun 2024 16:37
Last modified: 13 Jun 2024 16:38
Export record
Altmetrics
Contributors
Author:
Ting Dang
Author:
Jing Han
Author:
Tong Xia
Author:
Dimitris Spathis
Author:
Erika Bondareva
Author:
Chloë Siegele-Brown
Author:
Jagmohan Chauhan
Author:
Andreas Grammenos
Author:
Apinan Hasthanasombat
Author:
R. Andres Floto
Author:
Pietro Cicuta
Author:
Cecilia Mascolo
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics