The University of Southampton
University of Southampton Institutional Repository

Staging by the book: automatic sleep stage classification using scoring rules

Staging by the book: automatic sleep stage classification using scoring rules
Staging by the book: automatic sleep stage classification using scoring rules
Automated sleep staging is commonly approached as a supervised machine learning problem, with deep learning methods dominating recent research. While machine learning models achieve near-human level agreement with human-scored reference sleep stages, their decisions are typically opaque and not designed to follow clinical scoring rules. We propose a transparent alternative: a deterministic, rule-based sleep staging method that explicitly operationalizes the American Academy of Sleep Medicine's (AASM) scoring logic as executable code, coupled with epoch-level natural-language justifications derived from an explanation trace. We evaluate the approach on 50 polysomnography recordings with a 10-scorer majority-vote consensus as reference. Across all recordings, the method agreed with the majority-vote reference in 60.5% of epochs (κ=0.42), with substantially higher agreement on a dataset used during development (77.1%, κ=0.61). Agreement with the reference was highest for sleep stage N2 (recall 83.5%) and moderate for sleep stage R (recall 68.7%), while Wake and N1 recall were low. Despite lower agreement with the reference than contemporary deep learning models, the method provides deterministic decisions and natural language explanations aligned with AASM scoring rules, making it a complementary tool for auditing, debugging, and governing deep learning-based sleep staging.
eess.SP, cs.AI
arXiv
Hardarson, Emil
bbdcb067-9e1b-4995-9340-cab3e48b981e
Popov, Konstantin
bad11dc0-a4c4-4470-bb52-fcf66b060d33
Sigurdardottir, Sigridur
b2681eff-160f-4742-b160-487912bac9e5
Islind, Anna Sigridur
aefc8cda-7d3e-4367-bfca-e9a2261fe87f
Arnardóttir, Erna Sif
9bfbbe32-8214-47a9-86ba-43be85458830
Óskarsdóttir, María
d159ed8f-9dd3-4ff3-8b00-d43579ab71be
Hardarson, Emil
bbdcb067-9e1b-4995-9340-cab3e48b981e
Popov, Konstantin
bad11dc0-a4c4-4470-bb52-fcf66b060d33
Sigurdardottir, Sigridur
b2681eff-160f-4742-b160-487912bac9e5
Islind, Anna Sigridur
aefc8cda-7d3e-4367-bfca-e9a2261fe87f
Arnardóttir, Erna Sif
9bfbbe32-8214-47a9-86ba-43be85458830
Óskarsdóttir, María
d159ed8f-9dd3-4ff3-8b00-d43579ab71be

[Unknown type: UNSPECIFIED]

Record type: UNSPECIFIED

Abstract

Automated sleep staging is commonly approached as a supervised machine learning problem, with deep learning methods dominating recent research. While machine learning models achieve near-human level agreement with human-scored reference sleep stages, their decisions are typically opaque and not designed to follow clinical scoring rules. We propose a transparent alternative: a deterministic, rule-based sleep staging method that explicitly operationalizes the American Academy of Sleep Medicine's (AASM) scoring logic as executable code, coupled with epoch-level natural-language justifications derived from an explanation trace. We evaluate the approach on 50 polysomnography recordings with a 10-scorer majority-vote consensus as reference. Across all recordings, the method agreed with the majority-vote reference in 60.5% of epochs (κ=0.42), with substantially higher agreement on a dataset used during development (77.1%, κ=0.61). Agreement with the reference was highest for sleep stage N2 (recall 83.5%) and moderate for sleep stage R (recall 68.7%), while Wake and N1 recall were low. Despite lower agreement with the reference than contemporary deep learning models, the method provides deterministic decisions and natural language explanations aligned with AASM scoring rules, making it a complementary tool for auditing, debugging, and governing deep learning-based sleep staging.

Text
2605.22859v1 - Author's Original
Download (829kB)

More information

Published date: 19 May 2026
Keywords: eess.SP, cs.AI

Identifiers

Local EPrints ID: 511794
URI: http://eprints.soton.ac.uk/id/eprint/511794
PURE UUID: a74fa111-6fd5-4027-8979-8c07a8b606e7
ORCID for María Óskarsdóttir: ORCID iD orcid.org/0000-0001-5095-5356

Catalogue record

Date deposited: 02 Jun 2026 16:52
Last modified: 03 Jun 2026 02:12

Export record

Altmetrics

Contributors

Author: Emil Hardarson
Author: Konstantin Popov
Author: Sigridur Sigurdardottir
Author: Anna Sigridur Islind
Author: Erna Sif Arnardóttir
Author: María Óskarsdóttir ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×