The University of Southampton
University of Southampton Institutional Repository

Flattening the curve through reinforcement learning driven test and trace policies

Flattening the curve through reinforcement learning driven test and trace policies
Flattening the curve through reinforcement learning driven test and trace policies
An effective way of limiting the diffusion of viruses when vaccines are unavailable or insufficiently potent to eradicate them is through running widespread “test and trace” programmes. Although these have been instrumental during the COVID-19 pandemic, they also lead to significant increases in public spending and societal disruptions caused by the numerous isolation requirements. What is more, after the health measures were relaxed across the world, these programmes were unable to prevent substantial upsurges in infections. Here we propose an alternative approach to conducting pathogen testing and contact tracing that is adaptable to the budgeting requirements and risk tolerances of regional policy makers, while still breaking the high risk transmission chains. To that end, we propose several agents that rank individuals based on the role they possess in their interaction network and the epidemic state over which this diffuses, showing that testing or isolating just the top ranked can achieve adequate levels of containment without incurring the costs associated with standard strategies. Additionally, we extensively compare all the policies we derive, and show that a reinforcement learning actor based on graph neural networks outcompetes the more competitive heuristics by up to 15% in the containment rate, while far surpassing the standard random samplers by margins of 50% or more. Finally, we clearly demonstrate the versatility of the learned policies by appraising the decisions taken by the deep learning agent in different contexts using a diverse set of prediction explanation and state visualization techniques.
epidemic-control, reinforcement-learning, target-test-and-trace;, target test and trace, epidemic control, reinforcement learning
1867-8211
174-206
Springer Cham
Rusu, Andrei
901b9bc5-f776-4046-b694-e302c40c31b3
Farrahi, Kate
bc848b9c-fc32-475c-b241-f6ade8babacb
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Tsanas, A.
Triantafyllidis, A.
Rusu, Andrei
901b9bc5-f776-4046-b694-e302c40c31b3
Farrahi, Kate
bc848b9c-fc32-475c-b241-f6ade8babacb
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Tsanas, A.
Triantafyllidis, A.

Rusu, Andrei, Farrahi, Kate and Niranjan, Mahesan (2023) Flattening the curve through reinforcement learning driven test and trace policies. Tsanas, A. and Triantafyllidis, A. (eds.) In International Conference on Pervasive Computing Technologies for Healthcare. vol. 488 LNICST, Springer Cham. pp. 174-206 . (doi:10.1007/978-3-031-34586-9_14).

Record type: Conference or Workshop Item (Paper)

Abstract

An effective way of limiting the diffusion of viruses when vaccines are unavailable or insufficiently potent to eradicate them is through running widespread “test and trace” programmes. Although these have been instrumental during the COVID-19 pandemic, they also lead to significant increases in public spending and societal disruptions caused by the numerous isolation requirements. What is more, after the health measures were relaxed across the world, these programmes were unable to prevent substantial upsurges in infections. Here we propose an alternative approach to conducting pathogen testing and contact tracing that is adaptable to the budgeting requirements and risk tolerances of regional policy makers, while still breaking the high risk transmission chains. To that end, we propose several agents that rank individuals based on the role they possess in their interaction network and the epidemic state over which this diffuses, showing that testing or isolating just the top ranked can achieve adequate levels of containment without incurring the costs associated with standard strategies. Additionally, we extensively compare all the policies we derive, and show that a reinforcement learning actor based on graph neural networks outcompetes the more competitive heuristics by up to 15% in the containment rate, while far surpassing the standard random samplers by margins of 50% or more. Finally, we clearly demonstrate the versatility of the learned policies by appraising the decisions taken by the deep learning agent in different contexts using a diverse set of prediction explanation and state visualization techniques.

Text
FlatteningTheCurveRL_EAIPervasiveHealth22 - Accepted Manuscript
Download (1MB)

More information

Accepted/In Press date: 29 October 2022
e-pub ahead of print date: 11 June 2023
Published date: 6 November 2023
Venue - Dates: 16th EAI International Conference on Pervasive Computing Technologies for Healthcare, , Thessaloniki, Greece, 2022-12-12 - 2022-12-14
Keywords: epidemic-control, reinforcement-learning, target-test-and-trace;, target test and trace, epidemic control, reinforcement learning

Identifiers

Local EPrints ID: 473363
URI: http://eprints.soton.ac.uk/id/eprint/473363
ISSN: 1867-8211
PURE UUID: 1ecad9ee-e411-4de5-9c84-7e73bd89271b
ORCID for Andrei Rusu: ORCID iD orcid.org/0000-0002-6053-1685
ORCID for Kate Farrahi: ORCID iD orcid.org/0000-0001-6775-127X
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 17 Jan 2023 17:32
Last modified: 06 Nov 2024 05:01

Export record

Altmetrics

Contributors

Author: Andrei Rusu ORCID iD
Author: Kate Farrahi ORCID iD
Author: Mahesan Niranjan ORCID iD
Editor: A. Tsanas
Editor: A. Triantafyllidis

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×