The University of Southampton
University of Southampton Institutional Repository

An informatics approach to profiling patient experiences using electronic health records: constructing and clustering the burden space of individuals under 65 years of age with multiple long-term conditions

An informatics approach to profiling patient experiences using electronic health records: constructing and clustering the burden space of individuals under 65 years of age with multiple long-term conditions
An informatics approach to profiling patient experiences using electronic health records: constructing and clustering the burden space of individuals under 65 years of age with multiple long-term conditions
Living with multiple long-term conditions (MLTC) profoundly impacts patients’ lives, affecting not only their health but also their financial, emotional, and social well-being. It can impose a significant burden on people. Here we take a novel approach, exploring the lived experience of individuals with MLTC by identifying patterns of burden—spanning physical, emotional, social, and financial domains—using machine learning techniques applied to electronic health records (EHR).

We constructed a cohort of 310,990 individuals born between January 1, 1958, and December 31, 1967, all with two or more long-term conditions. Proxy indicators of burden were extracted from EHR data. Using k-means clustering, we identified subgroups of individuals with distinct burden profiles and analyzed the distribution of burden indicators within each cluster.

Several large clusters were characterized by high prevalence of one or more of pain, anxiety, and depression. Most clusters were predominantly female, with females over-represented compared to the overall burden cohort. Socioeconomic disparities were evident: clusters marked by pain had a higher proportion of individuals from the most deprived areas, while clusters characterised by stress or anxiety alone had a higher proportion of those from the least deprived areas. Certain combinations of burden indicators tended to be over-represented in the same clusters, such as pain with mobility problems, and depression with very high A&E arrivals, and separation/divorce.

This study demonstrates the utility of machine learning for uncovering nuanced, patient-centered patterns in the experience of living with MLTC. The clustering approach reveals how different burdens intersect and vary across demographic and socioeconomic lines, offering insights that could inform more tailored and equitable care strategies.
medRxiv
Shiranirad, Mozhdeh
d25e36f5-e99d-4aec-b1c2-ca17e6d7a490
Zlatev, Zlatko
8f2e3635-d76c-46e2-85b9-53cc223fee01
Chiovoloni, Roberta
593d5cf9-f7c7-4ef9-a459-e627b63b3606
Holland, Emilia
0a16c883-d72d-44bb-b780-0d30b8e06aab
Dylag, Jakub
419a56cd-af18-401e-bd4a-070a4d76270b
Alwan, Nisreen A.
0d37b320-f325-4ed3-ba51-0fe2866d5382
Berrington, Ann
bd0fc093-310d-4236-8126-ca0c7eb9ddde
Boniface, Michael
f30bfd7d-20ed-451b-b405-34e3e22fdfba
Fraser, Simon D.S.
cda9a739-eedc-47e5-9dd7-17dda85349ee
Hoyle, Rebecca B.
e980d6a8-b750-491b-be13-84d695f8b8a1
Shiranirad, Mozhdeh
d25e36f5-e99d-4aec-b1c2-ca17e6d7a490
Zlatev, Zlatko
8f2e3635-d76c-46e2-85b9-53cc223fee01
Chiovoloni, Roberta
593d5cf9-f7c7-4ef9-a459-e627b63b3606
Holland, Emilia
0a16c883-d72d-44bb-b780-0d30b8e06aab
Dylag, Jakub
419a56cd-af18-401e-bd4a-070a4d76270b
Alwan, Nisreen A.
0d37b320-f325-4ed3-ba51-0fe2866d5382
Berrington, Ann
bd0fc093-310d-4236-8126-ca0c7eb9ddde
Boniface, Michael
f30bfd7d-20ed-451b-b405-34e3e22fdfba
Fraser, Simon D.S.
cda9a739-eedc-47e5-9dd7-17dda85349ee
Hoyle, Rebecca B.
e980d6a8-b750-491b-be13-84d695f8b8a1

[Unknown type: UNSPECIFIED]

Record type: UNSPECIFIED

Abstract

Living with multiple long-term conditions (MLTC) profoundly impacts patients’ lives, affecting not only their health but also their financial, emotional, and social well-being. It can impose a significant burden on people. Here we take a novel approach, exploring the lived experience of individuals with MLTC by identifying patterns of burden—spanning physical, emotional, social, and financial domains—using machine learning techniques applied to electronic health records (EHR).

We constructed a cohort of 310,990 individuals born between January 1, 1958, and December 31, 1967, all with two or more long-term conditions. Proxy indicators of burden were extracted from EHR data. Using k-means clustering, we identified subgroups of individuals with distinct burden profiles and analyzed the distribution of burden indicators within each cluster.

Several large clusters were characterized by high prevalence of one or more of pain, anxiety, and depression. Most clusters were predominantly female, with females over-represented compared to the overall burden cohort. Socioeconomic disparities were evident: clusters marked by pain had a higher proportion of individuals from the most deprived areas, while clusters characterised by stress or anxiety alone had a higher proportion of those from the least deprived areas. Certain combinations of burden indicators tended to be over-represented in the same clusters, such as pain with mobility problems, and depression with very high A&E arrivals, and separation/divorce.

This study demonstrates the utility of machine learning for uncovering nuanced, patient-centered patterns in the experience of living with MLTC. The clustering approach reveals how different burdens intersect and vary across demographic and socioeconomic lines, offering insights that could inform more tailored and equitable care strategies.

Text
2025.11.27.25341182v1.full - Author's Original
Available under License Creative Commons Attribution.
Download (6MB)

More information

e-pub ahead of print date: 2 December 2025

Identifiers

Local EPrints ID: 508625
URI: http://eprints.soton.ac.uk/id/eprint/508625
PURE UUID: 6bde6daa-1dcf-4b85-9aa0-e8b215f3999a
ORCID for Mozhdeh Shiranirad: ORCID iD orcid.org/0000-0003-4346-3059
ORCID for Zlatko Zlatev: ORCID iD orcid.org/0000-0001-6275-5626
ORCID for Emilia Holland: ORCID iD orcid.org/0000-0001-5722-3836
ORCID for Jakub Dylag: ORCID iD orcid.org/0000-0001-6263-7339
ORCID for Nisreen A. Alwan: ORCID iD orcid.org/0000-0002-4134-8463
ORCID for Ann Berrington: ORCID iD orcid.org/0000-0002-1683-6668
ORCID for Michael Boniface: ORCID iD orcid.org/0000-0002-9281-6095
ORCID for Rebecca B. Hoyle: ORCID iD orcid.org/0000-0002-1645-1071

Catalogue record

Date deposited: 28 Jan 2026 17:49
Last modified: 03 Feb 2026 02:42

Export record

Altmetrics

Contributors

Author: Mozhdeh Shiranirad ORCID iD
Author: Zlatko Zlatev ORCID iD
Author: Roberta Chiovoloni
Author: Emilia Holland ORCID iD
Author: Jakub Dylag ORCID iD
Author: Ann Berrington ORCID iD
Author: Simon D.S. Fraser

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×