The University of Southampton
University of Southampton Institutional Repository

Applications of machine learning for the diagnosis of sleep apnoea and the causal analysis of sleep studies

Applications of machine learning for the diagnosis of sleep apnoea and the causal analysis of sleep studies
Applications of machine learning for the diagnosis of sleep apnoea and the causal analysis of sleep studies
This thesis is an exploration of the applications of machine learning (ML) to issues relating to sleep apnoea. A person with sleep apnoea suffers apnoeas (cessation of breathing) and hypopnoeas (less severe events) during sleep. It is a common condition that is associated with negative health outcomes. The aims of the study are to reduce the cost and difficulty of diagnosing sleep apnoea and to uncover the mechanisms of sleep using causal discovery. Subsequently, contributions are made to methods of causal discovery.

Patients with suspected sleep apnoea are required to undertake a sleep study, in which a number of continuous measurements are taken overnight. The data from the study are then assessed by a specialist. This carries a significant cost, due to the equipment required and the demands on clinicians' time, limiting the capacity of healthcare providers to deal with this serious and underdiagnosed condition. It would save resources if this process were automated using machine learning. In addition, clinicians would like to use a standalone pulse oximeter for this, rather than the full sleep study kit. In this thesis, ML models are trained to classify windows of a pulse oximetry trace according to whether or not they contain an apnoea or hypopnoea, and whether or not the patient is asleep. It is demonstrated that models trained on data from the full kit can achieve good results on data from the standalone pulse oximeter. Contrary to expectations, the prediction error from these experiments is not associated with the perfusion index (PI), a measure used to quantify signal quality.

Greater understanding of the physiological mechanisms of sleep could improve treatment. Accordingly, causal reasoning is applied to this field of study. Firstly, Granger causality is applied to a large sleep study dataset. DYNOTEARS, a recently-introduced method of structure learning by continuous optimisation, is then applied to the same data, and the results are compared. With both methods, an association is found between the girth of a person's waist and the presence of a causal link between the heart and the brain.

DYNOTEARS is highly dependent on the choice of hyperparameters, a fact which is often overlooked in the literature. This is demonstrated, and it is shown that Bayesian optimisation can find these hyperparameters quickly and reliably. This improves the ease of use of this method.

The development of time-series structure learning models is limited by the lack of benchmarking data. This data must be synthetic so that the true causal structure is known, but it is challenging to create stable models for this purpose. Initial work is presented towards a new technique to randomly generate stable models for benchmarking structure learning methods.
University of Southampton
Thomas, Alexander Edward
cf9b912d-19f4-41f0-9b3c-47eddb240ae6
Thomas, Alexander Edward
cf9b912d-19f4-41f0-9b3c-47eddb240ae6
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Legg, Julian
72afc985-52a4-4504-84c4-50a8e018891f
MacArthur, Ben
4fa2fa9d-b8e5-48b3-b98d-930e1c7f7fff

Thomas, Alexander Edward (2026) Applications of machine learning for the diagnosis of sleep apnoea and the causal analysis of sleep studies. University of Southampton, Doctoral Thesis, 134pp.

Record type: Thesis (Doctoral)

Abstract

This thesis is an exploration of the applications of machine learning (ML) to issues relating to sleep apnoea. A person with sleep apnoea suffers apnoeas (cessation of breathing) and hypopnoeas (less severe events) during sleep. It is a common condition that is associated with negative health outcomes. The aims of the study are to reduce the cost and difficulty of diagnosing sleep apnoea and to uncover the mechanisms of sleep using causal discovery. Subsequently, contributions are made to methods of causal discovery.

Patients with suspected sleep apnoea are required to undertake a sleep study, in which a number of continuous measurements are taken overnight. The data from the study are then assessed by a specialist. This carries a significant cost, due to the equipment required and the demands on clinicians' time, limiting the capacity of healthcare providers to deal with this serious and underdiagnosed condition. It would save resources if this process were automated using machine learning. In addition, clinicians would like to use a standalone pulse oximeter for this, rather than the full sleep study kit. In this thesis, ML models are trained to classify windows of a pulse oximetry trace according to whether or not they contain an apnoea or hypopnoea, and whether or not the patient is asleep. It is demonstrated that models trained on data from the full kit can achieve good results on data from the standalone pulse oximeter. Contrary to expectations, the prediction error from these experiments is not associated with the perfusion index (PI), a measure used to quantify signal quality.

Greater understanding of the physiological mechanisms of sleep could improve treatment. Accordingly, causal reasoning is applied to this field of study. Firstly, Granger causality is applied to a large sleep study dataset. DYNOTEARS, a recently-introduced method of structure learning by continuous optimisation, is then applied to the same data, and the results are compared. With both methods, an association is found between the girth of a person's waist and the presence of a causal link between the heart and the brain.

DYNOTEARS is highly dependent on the choice of hyperparameters, a fact which is often overlooked in the literature. This is demonstrated, and it is shown that Bayesian optimisation can find these hyperparameters quickly and reliably. This improves the ease of use of this method.

The development of time-series structure learning models is limited by the lack of benchmarking data. This data must be synthetic so that the true causal structure is known, but it is challenging to create stable models for this purpose. Initial work is presented towards a new technique to randomly generate stable models for benchmarking structure learning methods.

Text
Alexander Thomas PhD Thesis - Version of Record
Available under License University of Southampton Thesis Licence.
Download (4MB)
Text
Final-thesis-submission-Examination-Mr-Alexander-Thomas
Restricted to Repository staff only

More information

Published date: 2026

Identifiers

Local EPrints ID: 511701
URI: http://eprints.soton.ac.uk/id/eprint/511701
PURE UUID: 4dfa8527-29d6-4aef-8528-38e075932d87
ORCID for Alexander Edward Thomas: ORCID iD orcid.org/0000-0003-2098-9373
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 28 May 2026 16:46
Last modified: 30 May 2026 02:18

Export record

Altmetrics

Contributors

Author: Alexander Edward Thomas ORCID iD
Thesis advisor: Mahesan Niranjan ORCID iD
Thesis advisor: Julian Legg
Thesis advisor: Ben MacArthur

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×