Inference from medical images with linear discriminant analysis and deep learning-derived features
Inference from medical images with linear discriminant analysis and deep learning-derived features
Several challenges arise in the application of modern computer vision and machine learning techniques to making inferences from medical images. They include relatively low sample images, biases inherent in the training data that can act as confounding variables, and the need to incorporate clinical variables about patients. In this dissertation, we combine deep neural networks, pre-trained on large datasets of natural images (e.g. ImageNet) acting as feature extractors, and linear discriminant subspaces to address these issues. We start with our extension to the classic linear discriminant analysis (LDA) to derive multiple mutually orthogonal discriminant directions in the multi-class discriminant subspace, which we refer to as generalised optimal LDA (GO-LDA). Unlike previous work in the field of LDA, the discriminability of the subspace we derive is not limited by the number of classes in the multi-class problem. Empirical work on 14 datasets covering 11 different disease domains shows the advantage of this discriminant subspace approach in a few-shot learning setting. Furthermore, bias in data, arising from confounding information from protected characteristics (e.g. gender or skin tone color), which can lead to unacceptable decision barriers in society, is of serious concern in medical inference problems. Here, we show how an approach biased towards deriving orthogonal discriminant directions, whereby one direction separates the protected characteristic and one separates disease state, can effectively address this issue. This is demonstrated using dermatology and chest X-ray problems in which skin tone color and gender induce confounding issues. Finally, we demonstrate that appending relevant clinical variables in the reduced discriminant subspace is effective, as demonstrated using tuberculosis and dermatology datasets.
University of Southampton
Liu, Jiahui
53a77bdd-58f3-455a-b4a8-2ee9927a0027
September 2024
Liu, Jiahui
53a77bdd-58f3-455a-b4a8-2ee9927a0027
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Liu, Jiahui
(2024)
Inference from medical images with linear discriminant analysis and deep learning-derived features.
University of Southampton, Doctoral Thesis, 119pp.
Record type:
Thesis
(Doctoral)
Abstract
Several challenges arise in the application of modern computer vision and machine learning techniques to making inferences from medical images. They include relatively low sample images, biases inherent in the training data that can act as confounding variables, and the need to incorporate clinical variables about patients. In this dissertation, we combine deep neural networks, pre-trained on large datasets of natural images (e.g. ImageNet) acting as feature extractors, and linear discriminant subspaces to address these issues. We start with our extension to the classic linear discriminant analysis (LDA) to derive multiple mutually orthogonal discriminant directions in the multi-class discriminant subspace, which we refer to as generalised optimal LDA (GO-LDA). Unlike previous work in the field of LDA, the discriminability of the subspace we derive is not limited by the number of classes in the multi-class problem. Empirical work on 14 datasets covering 11 different disease domains shows the advantage of this discriminant subspace approach in a few-shot learning setting. Furthermore, bias in data, arising from confounding information from protected characteristics (e.g. gender or skin tone color), which can lead to unacceptable decision barriers in society, is of serious concern in medical inference problems. Here, we show how an approach biased towards deriving orthogonal discriminant directions, whereby one direction separates the protected characteristic and one separates disease state, can effectively address this issue. This is demonstrated using dermatology and chest X-ray problems in which skin tone color and gender induce confounding issues. Finally, we demonstrate that appending relevant clinical variables in the reduced discriminant subspace is effective, as demonstrated using tuberculosis and dermatology datasets.
Text
Thesis
- Version of Record
Text
Final-thesis-submission-Examination-Miss-Jiahui-Liu
Restricted to Repository staff only
More information
Published date: September 2024
Identifiers
Local EPrints ID: 493897
URI: http://eprints.soton.ac.uk/id/eprint/493897
PURE UUID: 8b935fa4-7cb4-40a8-bc48-07758f3be21a
Catalogue record
Date deposited: 17 Sep 2024 16:34
Last modified: 21 Sep 2024 02:04
Export record
Contributors
Author:
Jiahui Liu
Thesis advisor:
Xiaohao Cai
Thesis advisor:
Mahesan Niranjan
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics