Semantic Models for Machine Learning

In this thesis we present approaches to the creation and usage of semantic models by the analysis of the data spread in the feature space. We aim to introduce the general notion of using feature selection techniques in machine learning applications. The applied approaches obtain new feature directions on data, such that machine learning applications would show an increase in performance. We review three principle methods that are used throughout the thesis. Firstly Canonical Correlation Analysis (CCA), which is a method of correlating linear relationships between two multidimensional variables. CCA can be seen as using complex labels as a way of guiding feature selection towards the underlying semantics. CCA makes use of two views of the same semantic object to extract a representation of the semantics. Secondly Partial Least Squares (PLS), a method similar to CCA. It selects feature directions that are useful for the task at hand, though PLS only uses one view of an object and the label as the corresponding pair. PLS could be thought of as a method that looks for directions that are good for distinguishing the different labels. The third method is the Fisher kernel. A method that aims to extract more information of a generative model than simply by their output probabilities. The aim is to analyse how the Fisher score depends on the model and which aspects of the model are important in determining the Fisher score. We focus our theoretical investigation primarily on CCA and its kernel variant. Providing a theoretical analysis of the method's stability using Rademacher complexity, hence deriving the error bound for new data. We conclude the thesis by applying the described approaches to problems in the various fields of image, text, music application and medical analysis, describing several novel applications on relevant real-world data. The aim of the thesis is to provide a theoretical understanding of semantic models, while also providing a good application foundation on how these models can be practically used.

Kernel Methods, SVM, Neural Networks, Bayesian Networks, Fisher Kernels, CCA, KCCA, PLS, KPLS, Gram-Schmidt, fMRI, Content-Based Retrieval, Music Score Generation, Composer and Performer Identification, Medical Analysis

Hardoon, David R

ba75cf10-43f2-4701-b648-3bb90edbabbf

February 2006

Hardoon, David R

ba75cf10-43f2-4701-b648-3bb90edbabbf

Hardoon, David R (2006) Semantic Models for Machine Learning. University of Southampton, School of Electornics and Computer Science, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

Text

Hardoon_Thesis.pdf - Other

Download (5MB)

More information

Published date: February 2006

Keywords: Kernel Methods, SVM, Neural Networks, Bayesian Networks, Fisher Kernels, CCA, KCCA, PLS, KPLS, Gram-Schmidt, fMRI, Content-Based Retrieval, Music Score Generation, Composer and Performer Identification, Medical Analysis

Organisations: University of Southampton, Electronics & Computer Science

Learn more about the Electronics & Computer Science

Identifiers

Local EPrints ID: 262019

URI: http://eprints.soton.ac.uk/id/eprint/262019

PURE UUID: f967313b-bf4a-48a3-933c-dbb714203a92

Catalogue record

Date deposited: 22 Feb 2006

Last modified: 14 Mar 2024 07:03

Export record

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: David R Hardoon

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information