The University of Southampton
University of Southampton Institutional Repository

Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning

Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning
Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning

Objective Complex phenotypes captured on histological slides represent the biological processes at play in individual cancers, but the link to underlying molecular classification has not been clarified or systematised. In colorectal cancer (CRC), histological grading is a poor predictor of disease progression, and consensus molecular subtypes (CMSs) cannot be distinguished without gene expression profiling. We hypothesise that image analysis is a cost-effective tool to associate complex features of tissue organisation with molecular and outcome data and to resolve unclassifiable or heterogeneous cases. In this study, we present an image-based approach to predict CRC CMS from standard H&E sections using deep learning. Design Training and evaluation of a neural network were performed using a total of n=1206 tissue sections with comprehensive multi-omic data from three independent datasets (training on FOCUS trial, n=278 patients; test on rectal cancer biopsies, GRAMPIAN cohort, n=144 patients; and The Cancer Genome Atlas (TCGA), n=430 patients). Ground truth CMS calls were ascertained by matching random forest and single sample predictions from CMS classifier. Results Image-based CMS (imCMS) accurately classified slides in unseen datasets from TCGA (n=431 slides, AUC)=0.84) and rectal cancer biopsies (n=265 slides, AUC=0.85). imCMS spatially resolved intratumoural heterogeneity and provided secondary calls correlating with bioinformatic prediction from molecular data. imCMS classified samples previously unclassifiable by RNA expression profiling, reproduced the expected correlations with genomic and epigenetic alterations and showed similar prognostic associations as transcriptomic CMS. Conclusion This study shows that a prediction of RNA expression classifiers can be made from H&E images, opening the door to simple, cheap and reliable biological stratification within routine workflows.

colorectal pathology, computerised image analysis, molecular pathology, Predictive Value of Tests, Datasets as Topic, Prognosis, Humans, Colorectal Neoplasms/genetics, Gene Expression Profiling, Consensus, Deep Learning, Disease Progression, Biomarkers, Tumor/genetics, RNA/genetics, Phenotype, Neoplasm Grading, Biopsy, Gene Expression Regulation, Neoplastic/genetics
0017-5749
544-554
Sirinukunwattana, Korsuk
481c6281-3e38-469d-8a1b-44b724cb3942
Domingo, Enric
10314c45-ec5d-443a-92c7-9d61f5e8169a
Richman, Susan D.
aae8a630-3e31-472b-9976-55c7f621ff55
Redmond, Keara L.
1970045f-02ae-4e28-a296-23ecfc888c16
Blake, Andrew
c2875bc5-7ce5-4d4c-a7c9-a1cb15e40a56
Verrill, Clare
aebaa0d4-fb62-4bf6-97ab-5f9bb23cf9a3
Leedham, Simon J.
636b8685-c45a-4615-bd89-0d4e0b6a1e7c
Chatzipli, Aikaterini
a71ae3f1-dea5-48c6-b6aa-f04554cafd1c
Hardy, Claire
be990bf5-e565-41d9-b9ad-60796348bfcf
Whalley, Celina M.
64c2a23b-31f6-437e-be9c-d1668edee99b
Wu, Chieh Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
Beggs, Andrew D.
fe399221-77bc-4be2-b45e-739e1d889581
McDermott, Ultan
3f415741-9a19-482f-baf0-67f43b9312aa
Dunne, Philip D.
2ff6ea41-7610-40a8-90d0-fa6dea662cc6
Meade, Angela
8b7678c2-d941-4c4b-abef-bbe15a574864
Walker, Steven M.
c470c42e-1400-490c-905d-ef1d66037713
Murray, Graeme I.
35ca95fd-3168-4db6-a842-531ad24812f5
Samuel, Leslie
f72eeceb-c97a-40d5-bdfa-42c3ec52c413
Seymour, Matthew
398287ed-4467-487d-b851-b43ceefc2c5e
Tomlinson, Ian
e3593fea-8b8c-44d8-a922-f7d8f0b62a23
Quirke, Phil
e0a608ee-6f6a-4972-b7ff-2e94780fcc56
Maughan, Timothy
195bfe69-ebc3-4093-94fc-599733589530
Rittscher, Jens
a2037278-6051-4ee9-a16e-b7707dc9bbdc
Koelzer, Viktor H.
13492523-a903-4e01-9e08-8cb6f15fabcb
Sirinukunwattana, Korsuk
481c6281-3e38-469d-8a1b-44b724cb3942
Domingo, Enric
10314c45-ec5d-443a-92c7-9d61f5e8169a
Richman, Susan D.
aae8a630-3e31-472b-9976-55c7f621ff55
Redmond, Keara L.
1970045f-02ae-4e28-a296-23ecfc888c16
Blake, Andrew
c2875bc5-7ce5-4d4c-a7c9-a1cb15e40a56
Verrill, Clare
aebaa0d4-fb62-4bf6-97ab-5f9bb23cf9a3
Leedham, Simon J.
636b8685-c45a-4615-bd89-0d4e0b6a1e7c
Chatzipli, Aikaterini
a71ae3f1-dea5-48c6-b6aa-f04554cafd1c
Hardy, Claire
be990bf5-e565-41d9-b9ad-60796348bfcf
Whalley, Celina M.
64c2a23b-31f6-437e-be9c-d1668edee99b
Wu, Chieh Hsi
ace630c6-2095-4ade-b657-241692f6b4d3
Beggs, Andrew D.
fe399221-77bc-4be2-b45e-739e1d889581
McDermott, Ultan
3f415741-9a19-482f-baf0-67f43b9312aa
Dunne, Philip D.
2ff6ea41-7610-40a8-90d0-fa6dea662cc6
Meade, Angela
8b7678c2-d941-4c4b-abef-bbe15a574864
Walker, Steven M.
c470c42e-1400-490c-905d-ef1d66037713
Murray, Graeme I.
35ca95fd-3168-4db6-a842-531ad24812f5
Samuel, Leslie
f72eeceb-c97a-40d5-bdfa-42c3ec52c413
Seymour, Matthew
398287ed-4467-487d-b851-b43ceefc2c5e
Tomlinson, Ian
e3593fea-8b8c-44d8-a922-f7d8f0b62a23
Quirke, Phil
e0a608ee-6f6a-4972-b7ff-2e94780fcc56
Maughan, Timothy
195bfe69-ebc3-4093-94fc-599733589530
Rittscher, Jens
a2037278-6051-4ee9-a16e-b7707dc9bbdc
Koelzer, Viktor H.
13492523-a903-4e01-9e08-8cb6f15fabcb

Sirinukunwattana, Korsuk, Domingo, Enric, Richman, Susan D., Redmond, Keara L., Blake, Andrew, Verrill, Clare, Leedham, Simon J., Chatzipli, Aikaterini, Hardy, Claire, Whalley, Celina M., Wu, Chieh Hsi, Beggs, Andrew D., McDermott, Ultan, Dunne, Philip D., Meade, Angela, Walker, Steven M., Murray, Graeme I., Samuel, Leslie, Seymour, Matthew, Tomlinson, Ian, Quirke, Phil, Maughan, Timothy, Rittscher, Jens and Koelzer, Viktor H. (2021) Image-based consensus molecular subtype (imCMS) classification of colorectal cancer using deep learning. Gut, 70 (3), 544-554. (doi:10.1136/gutjnl-2019-319866).

Record type: Article

Abstract

Objective Complex phenotypes captured on histological slides represent the biological processes at play in individual cancers, but the link to underlying molecular classification has not been clarified or systematised. In colorectal cancer (CRC), histological grading is a poor predictor of disease progression, and consensus molecular subtypes (CMSs) cannot be distinguished without gene expression profiling. We hypothesise that image analysis is a cost-effective tool to associate complex features of tissue organisation with molecular and outcome data and to resolve unclassifiable or heterogeneous cases. In this study, we present an image-based approach to predict CRC CMS from standard H&E sections using deep learning. Design Training and evaluation of a neural network were performed using a total of n=1206 tissue sections with comprehensive multi-omic data from three independent datasets (training on FOCUS trial, n=278 patients; test on rectal cancer biopsies, GRAMPIAN cohort, n=144 patients; and The Cancer Genome Atlas (TCGA), n=430 patients). Ground truth CMS calls were ascertained by matching random forest and single sample predictions from CMS classifier. Results Image-based CMS (imCMS) accurately classified slides in unseen datasets from TCGA (n=431 slides, AUC)=0.84) and rectal cancer biopsies (n=265 slides, AUC=0.85). imCMS spatially resolved intratumoural heterogeneity and provided secondary calls correlating with bioinformatic prediction from molecular data. imCMS classified samples previously unclassifiable by RNA expression profiling, reproduced the expected correlations with genomic and epigenetic alterations and showed similar prognostic associations as transcriptomic CMS. Conclusion This study shows that a prediction of RNA expression classifiers can be made from H&E images, opening the door to simple, cheap and reliable biological stratification within routine workflows.

Text
544.full - Version of Record
Available under License Creative Commons Attribution.
Download (7MB)

More information

Accepted/In Press date: 8 June 2020
e-pub ahead of print date: 20 July 2020
Published date: 1 March 2021
Additional Information: Publisher Copyright: © Copyright: Copyright 2021 Elsevier B.V., All rights reserved.
Keywords: colorectal pathology, computerised image analysis, molecular pathology, Predictive Value of Tests, Datasets as Topic, Prognosis, Humans, Colorectal Neoplasms/genetics, Gene Expression Profiling, Consensus, Deep Learning, Disease Progression, Biomarkers, Tumor/genetics, RNA/genetics, Phenotype, Neoplasm Grading, Biopsy, Gene Expression Regulation, Neoplastic/genetics

Identifiers

Local EPrints ID: 455511
URI: http://eprints.soton.ac.uk/id/eprint/455511
ISSN: 0017-5749
PURE UUID: 51c15c91-b0ff-41d6-b3fb-788473b68ae2
ORCID for Chieh Hsi Wu: ORCID iD orcid.org/0000-0001-9386-725X

Catalogue record

Date deposited: 24 Mar 2022 17:32
Last modified: 17 Mar 2024 04:00

Export record

Altmetrics

Contributors

Author: Korsuk Sirinukunwattana
Author: Enric Domingo
Author: Susan D. Richman
Author: Keara L. Redmond
Author: Andrew Blake
Author: Clare Verrill
Author: Simon J. Leedham
Author: Aikaterini Chatzipli
Author: Claire Hardy
Author: Celina M. Whalley
Author: Chieh Hsi Wu ORCID iD
Author: Andrew D. Beggs
Author: Ultan McDermott
Author: Philip D. Dunne
Author: Angela Meade
Author: Steven M. Walker
Author: Graeme I. Murray
Author: Leslie Samuel
Author: Matthew Seymour
Author: Ian Tomlinson
Author: Phil Quirke
Author: Timothy Maughan
Author: Jens Rittscher
Author: Viktor H. Koelzer

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×