The University of Southampton
University of Southampton Institutional Repository

Clinical potential of whole-genome data linked to mortality statistics in patients with breast cancer in the UK: a retrospective analysis

Clinical potential of whole-genome data linked to mortality statistics in patients with breast cancer in the UK: a retrospective analysis
Clinical potential of whole-genome data linked to mortality statistics in patients with breast cancer in the UK: a retrospective analysis

Background: breast cancer is the most frequently diagnosed cancer in women. Survival is generally considered favourable, yet some patients remain at risk of early death. We aimed to assess whether comprehensive whole-genome sequencing (WGS) linked to mortality data could add prognostic value to existing clinical measures and identify patients who might respond to targeted therapeutics.

Methods: in this integrative, retrospective analysis, we analysed 2445 breast cancer tumours (any stage and molecular subtype) collected from 2403 patients recruited through 13 National Health Service Genomic Medicine Centres or hospitals in England affiliated to the 100 000 Genomes Project (100kGP) between 2012 and 2018. We linked 2208 (90%) cases with clinical data; mortality data were obtained for 1188 patients. Following high-depth WGS of tumour and matched normal DNA, we performed comprehensive WGS profiling seeking driver mutations, mutational signatures, and compound algorithmic scores for homologous recombination repair deficiency (HRD), mismatch repair deficiency, and tumour mutational burden. Data from 1803 additional patients with breast cancer from three independent cohorts were used to validate various findings. To evaluate the prognostic value of WGS features, we performed univariable and multivariable Cox regression on data from patients with stage I-III, ER-positive, HER2-negative breast cancer with a cancer-specific mortality endpoint (around 5-year follow-up).

Findings: among 2445 tumours in the 100kGP breast cancer cohort, we observed genomic characteristics with immediate personalised medicine potential in 656 (26·8%), including features reporting HRD (298 [12·2%] total cases and 76 [6·3%] ER-positive, HER2-negative cases), highly individualised driver events, mutations underpinning resistance to endocrine therapy, and mutational signatures indicating therapeutic vulnerabilities. 373 (15·2%) cases had WGS features with potential for translational research, including compromised base excision repair and non-homologous end-joining dependency. Structural variation burden (hazard ratio 3·9 [95 CI% 2·4-6·2]; p<0·0001), high levels of APOBEC signatures (2·5 [1·6-4·1]; p<0·0001), and TP53 drivers (3·9 [2·4-6·2]; p<0·0001) were independently prognostic of customary clinical measures (age at diagnosis, stage, and grade) in patients with ER-positive, HER2-negative breast cancer. We developed a prognosticator for ER-positive, HER2-negative breast cancer capable of identifying patients who require either increased intervention or therapy de-escalation, validating the framework in the independent Swedish Cancerome Analysis Network-Breast (SCAN-B) dataset.

Interpretation: we show that breast cancer genomes are rich in predictive and prognostic value. We propose a two-step model for effective clinical application. First, the identification of candidates for targeted therapies or clinical trials using highly individualised genomic markers. Second, for patients without such features, the implementation of enhanced prognostication using genomic features alongside existing clinical decision-making factors.

Funding: National Institute of Health Research, Breast Cancer Research Foundation, Dr Josef Steiner Cancer Research Award 2019, Basser Gray Prime Award 2020, Cancer Research UK, Sir Jeffrey Cheah Early Career Fellowship, the Mats Paulsson Foundation, the Fru Berta Kamprads Foundation, and the Swedish Research Council.

Adult, Aged, Biomarkers, Tumor/genetics, Breast Neoplasms/genetics, Female, Humans, Middle Aged, Mutation, Prognosis, Retrospective Studies, United Kingdom/epidemiology, Whole Genome Sequencing
1470-2045
1417-1431
Black, Daniella
a845c1f7-fa59-4972-a706-efa09fb2512d
Davies, Helen Ruth
66bae54c-99a2-49f2-b69c-2eef7b7362c8
Koh, Gene Ching Chiek
36f2fafd-02e5-4e42-8b65-8f4b13fe46cf
Chmelova, Lucia
910ffb8a-a9ed-4e1b-89ce-5feadbb111e6
Cubric, Marko
57f7caee-26b3-4bc0-a20e-bcdda49aa6a6
Chalivelaki Chan, Georgia
d45cf2c1-429e-4982-854e-191f1031c587
Degasperi, Andrea
201dafca-34bf-4e0a-a120-5c5003ab5938
Czarnecki, Jan
d890174a-e739-4adb-b111-4c9b84ad2baa
Toong, Ping Jing
1ba69dfc-ac80-495a-bbfa-e97fb7dc79de
Memari, Yasin
0e0f2fa5-d40d-49c6-adc4-2ddd17a36b91
Whitworth, James
6e3c195d-f457-41d9-bc9f-cc10e7bcde00
Zhao, Salome Jingchen
502b9a6b-dc28-4644-b2ee-ec324db5932b
Kumar, Yogesh
ed9d3c5c-7b40-4582-82ad-125f49a9f071
Basyuni, Shadi
c8dc210e-45c3-4494-9359-47c6875a3190
Rinaldi, Giuseppe
8ff3e225-b5d6-4d45-8a25-adae3df8bf4b
Shooter, Scott
e28cdf86-e46e-44da-972a-07de010f5f9d
Dembrovskyi, Vladyslav
cfc95337-ef28-4227-911d-463f3aa37b7c
Davies, Rosie
8782c592-8112-404b-a953-573509faecf8
Chatzou Dunford, Maria
8acd317e-a538-44f4-83c7-5cf19a2d6d2e
Copson, Ellen
a94cdbd6-f6e2-429d-a7c0-462c7da0e92b
Palmieri, Carlo
e6321c8b-c189-4173-ac81-05af5b20c611
Borg, Åke
8edd8af8-210e-4f75-bb33-d55be3f14513
Ambrose, John
bb4c6caa-2a38-4cb9-9a45-f386e23ddf33
Bunce, Catey
7c13264f-265d-4b0f-8a3b-e29fefd55df2
Sosinsky, Alona
7b95221b-1dca-4cb4-85cf-f4be2052e217
Arumugam, Prabhu
91b24ee5-5dc1-4bbc-888c-fc5d2292bb09
Brown, Matthew Arthur
9c052284-9158-479e-b392-e824d36757cb
Staaf, Johan
199d68e3-de9d-4af8-b612-5daa4da17679
Turner, Nicholas
fb542c87-39a3-4286-b958-b11db096442f
Nik-Zainal, Serena
ac473a59-43d8-48df-9eb0-4ee591caef27
et al.
Black, Daniella
a845c1f7-fa59-4972-a706-efa09fb2512d
Davies, Helen Ruth
66bae54c-99a2-49f2-b69c-2eef7b7362c8
Koh, Gene Ching Chiek
36f2fafd-02e5-4e42-8b65-8f4b13fe46cf
Chmelova, Lucia
910ffb8a-a9ed-4e1b-89ce-5feadbb111e6
Cubric, Marko
57f7caee-26b3-4bc0-a20e-bcdda49aa6a6
Chalivelaki Chan, Georgia
d45cf2c1-429e-4982-854e-191f1031c587
Degasperi, Andrea
201dafca-34bf-4e0a-a120-5c5003ab5938
Czarnecki, Jan
d890174a-e739-4adb-b111-4c9b84ad2baa
Toong, Ping Jing
1ba69dfc-ac80-495a-bbfa-e97fb7dc79de
Memari, Yasin
0e0f2fa5-d40d-49c6-adc4-2ddd17a36b91
Whitworth, James
6e3c195d-f457-41d9-bc9f-cc10e7bcde00
Zhao, Salome Jingchen
502b9a6b-dc28-4644-b2ee-ec324db5932b
Kumar, Yogesh
ed9d3c5c-7b40-4582-82ad-125f49a9f071
Basyuni, Shadi
c8dc210e-45c3-4494-9359-47c6875a3190
Rinaldi, Giuseppe
8ff3e225-b5d6-4d45-8a25-adae3df8bf4b
Shooter, Scott
e28cdf86-e46e-44da-972a-07de010f5f9d
Dembrovskyi, Vladyslav
cfc95337-ef28-4227-911d-463f3aa37b7c
Davies, Rosie
8782c592-8112-404b-a953-573509faecf8
Chatzou Dunford, Maria
8acd317e-a538-44f4-83c7-5cf19a2d6d2e
Copson, Ellen
a94cdbd6-f6e2-429d-a7c0-462c7da0e92b
Palmieri, Carlo
e6321c8b-c189-4173-ac81-05af5b20c611
Borg, Åke
8edd8af8-210e-4f75-bb33-d55be3f14513
Ambrose, John
bb4c6caa-2a38-4cb9-9a45-f386e23ddf33
Bunce, Catey
7c13264f-265d-4b0f-8a3b-e29fefd55df2
Sosinsky, Alona
7b95221b-1dca-4cb4-85cf-f4be2052e217
Arumugam, Prabhu
91b24ee5-5dc1-4bbc-888c-fc5d2292bb09
Brown, Matthew Arthur
9c052284-9158-479e-b392-e824d36757cb
Staaf, Johan
199d68e3-de9d-4af8-b612-5daa4da17679
Turner, Nicholas
fb542c87-39a3-4286-b958-b11db096442f
Nik-Zainal, Serena
ac473a59-43d8-48df-9eb0-4ee591caef27

Black, Daniella, Davies, Helen Ruth and Koh, Gene Ching Chiek , et al. (2025) Clinical potential of whole-genome data linked to mortality statistics in patients with breast cancer in the UK: a retrospective analysis. Lancet Oncology, 26 (11), 1417-1431. (doi:10.1016/S1470-2045(25)00400-0).

Record type: Article

Abstract

Background: breast cancer is the most frequently diagnosed cancer in women. Survival is generally considered favourable, yet some patients remain at risk of early death. We aimed to assess whether comprehensive whole-genome sequencing (WGS) linked to mortality data could add prognostic value to existing clinical measures and identify patients who might respond to targeted therapeutics.

Methods: in this integrative, retrospective analysis, we analysed 2445 breast cancer tumours (any stage and molecular subtype) collected from 2403 patients recruited through 13 National Health Service Genomic Medicine Centres or hospitals in England affiliated to the 100 000 Genomes Project (100kGP) between 2012 and 2018. We linked 2208 (90%) cases with clinical data; mortality data were obtained for 1188 patients. Following high-depth WGS of tumour and matched normal DNA, we performed comprehensive WGS profiling seeking driver mutations, mutational signatures, and compound algorithmic scores for homologous recombination repair deficiency (HRD), mismatch repair deficiency, and tumour mutational burden. Data from 1803 additional patients with breast cancer from three independent cohorts were used to validate various findings. To evaluate the prognostic value of WGS features, we performed univariable and multivariable Cox regression on data from patients with stage I-III, ER-positive, HER2-negative breast cancer with a cancer-specific mortality endpoint (around 5-year follow-up).

Findings: among 2445 tumours in the 100kGP breast cancer cohort, we observed genomic characteristics with immediate personalised medicine potential in 656 (26·8%), including features reporting HRD (298 [12·2%] total cases and 76 [6·3%] ER-positive, HER2-negative cases), highly individualised driver events, mutations underpinning resistance to endocrine therapy, and mutational signatures indicating therapeutic vulnerabilities. 373 (15·2%) cases had WGS features with potential for translational research, including compromised base excision repair and non-homologous end-joining dependency. Structural variation burden (hazard ratio 3·9 [95 CI% 2·4-6·2]; p<0·0001), high levels of APOBEC signatures (2·5 [1·6-4·1]; p<0·0001), and TP53 drivers (3·9 [2·4-6·2]; p<0·0001) were independently prognostic of customary clinical measures (age at diagnosis, stage, and grade) in patients with ER-positive, HER2-negative breast cancer. We developed a prognosticator for ER-positive, HER2-negative breast cancer capable of identifying patients who require either increased intervention or therapy de-escalation, validating the framework in the independent Swedish Cancerome Analysis Network-Breast (SCAN-B) dataset.

Interpretation: we show that breast cancer genomes are rich in predictive and prognostic value. We propose a two-step model for effective clinical application. First, the identification of candidates for targeted therapies or clinical trials using highly individualised genomic markers. Second, for patients without such features, the implementation of enhanced prognostication using genomic features alongside existing clinical decision-making factors.

Funding: National Institute of Health Research, Breast Cancer Research Foundation, Dr Josef Steiner Cancer Research Award 2019, Basser Gray Prime Award 2020, Cancer Research UK, Sir Jeffrey Cheah Early Career Fellowship, the Mats Paulsson Foundation, the Fru Berta Kamprads Foundation, and the Swedish Research Council.

Text
1-s2.0-S1470204525004000-main - Version of Record
Available under License Creative Commons Attribution.
Download (3MB)

More information

e-pub ahead of print date: 7 October 2025
Published date: 27 October 2025
Keywords: Adult, Aged, Biomarkers, Tumor/genetics, Breast Neoplasms/genetics, Female, Humans, Middle Aged, Mutation, Prognosis, Retrospective Studies, United Kingdom/epidemiology, Whole Genome Sequencing

Identifiers

Local EPrints ID: 508275
URI: http://eprints.soton.ac.uk/id/eprint/508275
ISSN: 1470-2045
PURE UUID: 904370c2-0225-417c-b883-3b1aa683d317

Catalogue record

Date deposited: 15 Jan 2026 18:09
Last modified: 15 Jan 2026 18:09

Export record

Altmetrics

Contributors

Author: Daniella Black
Author: Helen Ruth Davies
Author: Gene Ching Chiek Koh
Author: Lucia Chmelova
Author: Marko Cubric
Author: Georgia Chalivelaki Chan
Author: Andrea Degasperi
Author: Jan Czarnecki
Author: Ping Jing Toong
Author: Yasin Memari
Author: James Whitworth
Author: Salome Jingchen Zhao
Author: Yogesh Kumar
Author: Shadi Basyuni
Author: Giuseppe Rinaldi
Author: Scott Shooter
Author: Vladyslav Dembrovskyi
Author: Rosie Davies
Author: Maria Chatzou Dunford
Author: Ellen Copson
Author: Carlo Palmieri
Author: Åke Borg
Author: John Ambrose
Author: Catey Bunce
Author: Alona Sosinsky
Author: Prabhu Arumugam
Author: Matthew Arthur Brown
Author: Johan Staaf
Author: Nicholas Turner
Author: Serena Nik-Zainal
Corporate Author: et al.

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×