The University of Southampton
University of Southampton Institutional Repository

Deep cascade learning for optimal medical image feature representation

Deep cascade learning for optimal medical image feature representation
Deep cascade learning for optimal medical image feature representation

Cascade Learning (CL) is a new and alternative form of training a deep neural network in a layer-wise fashion. This varied training strategy results in different feature representations, advantageous due to the incremental complexity induced across layers of the network. We hypothesize that CL is inducing coarse-to-fine feature representations across layers of the network, differing from traditional end-to-end learning, advantageous for medical imaging applications. We use five different medical image classification tasks and a feature localisation task to show that CL is a superior learning strategy. We show that transferring cascade learned features from cascade trained models from a subset of ImageNet systematically outperforms transfer from traditional end-to-end training, often with statistical significance, but never worse. We demonstrate visually (using Grad-CAM saliency maps), numerically (using granulometry measures), and with error analysis that the features and also errors across the learning paradigms are different, motivating a combined approach, which we validate further improves performance. We find the features learned using CL are more closely aligned with medical expert labelled regions of interest on a large chest X-ray dataset. We further demonstrate other advantages of CL, such as robustness to noise and improved model calibration, which we suggest future work seriously consider as metrics to optimise, in addition to performance, prior to deployment in clinical settings.

54-78
PMLR
Wang, Junwen
fea12e84-8be0-4c8e-bd53-186dd353d55f
Du, Xin
9629013b-b962-4a81-bf18-7797d581fdd8
Farrahi, Katayoun
bc848b9c-fc32-475c-b241-f6ade8babacb
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Lipton, Zachary
Ranganath, Rajesh
Sendak, Mark
Sjoding, Michael
Yeung, Serena
Wang, Junwen
fea12e84-8be0-4c8e-bd53-186dd353d55f
Du, Xin
9629013b-b962-4a81-bf18-7797d581fdd8
Farrahi, Katayoun
bc848b9c-fc32-475c-b241-f6ade8babacb
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Lipton, Zachary
Ranganath, Rajesh
Sendak, Mark
Sjoding, Michael
Yeung, Serena

Wang, Junwen, Du, Xin, Farrahi, Katayoun and Niranjan, Mahesan (2022) Deep cascade learning for optimal medical image feature representation. Lipton, Zachary, Ranganath, Rajesh, Sendak, Mark, Sjoding, Michael and Yeung, Serena (eds.) In Proceedings of the 7th Machine Learning for Healthcare Conference. vol. 182, PMLR. pp. 54-78 .

Record type: Conference or Workshop Item (Paper)

Abstract

Cascade Learning (CL) is a new and alternative form of training a deep neural network in a layer-wise fashion. This varied training strategy results in different feature representations, advantageous due to the incremental complexity induced across layers of the network. We hypothesize that CL is inducing coarse-to-fine feature representations across layers of the network, differing from traditional end-to-end learning, advantageous for medical imaging applications. We use five different medical image classification tasks and a feature localisation task to show that CL is a superior learning strategy. We show that transferring cascade learned features from cascade trained models from a subset of ImageNet systematically outperforms transfer from traditional end-to-end training, often with statistical significance, but never worse. We demonstrate visually (using Grad-CAM saliency maps), numerically (using granulometry measures), and with error analysis that the features and also errors across the learning paradigms are different, motivating a combined approach, which we validate further improves performance. We find the features learned using CL are more closely aligned with medical expert labelled regions of interest on a large chest X-ray dataset. We further demonstrate other advantages of CL, such as robustness to noise and improved model calibration, which we suggest future work seriously consider as metrics to optimise, in addition to performance, prior to deployment in clinical settings.

This record has no associated files available for download.

More information

Published date: 2022
Venue - Dates: 7th Machine Learning for Healthcare Conference, MLHC 2022, , Durham, United States, 2022-08-05 - 2022-08-06

Identifiers

Local EPrints ID: 489969
URI: http://eprints.soton.ac.uk/id/eprint/489969
PURE UUID: eb403826-d392-4fa1-bb11-a02ecfaf7c15
ORCID for Katayoun Farrahi: ORCID iD orcid.org/0000-0001-6775-127X
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 09 May 2024 16:31
Last modified: 10 May 2024 01:51

Export record

Contributors

Author: Junwen Wang
Author: Xin Du
Author: Katayoun Farrahi ORCID iD
Author: Mahesan Niranjan ORCID iD
Editor: Zachary Lipton
Editor: Rajesh Ranganath
Editor: Mark Sendak
Editor: Michael Sjoding
Editor: Serena Yeung

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×