The University of Southampton
University of Southampton Institutional Repository

Ensemble algorithms and feature selection

Ensemble algorithms and feature selection
Ensemble algorithms and feature selection

A popular technique for modelling data is to construct an ensemble of learners and combine them in to a single hypothesis.  This final model can achieve an accuracy that is greater than that of the ensemble members, provided that there is a sufficient level of diversity within these learners.  Measuring and promoting this diversity can be achieved in a variety of ways and typically a trade-off exists between the accuracy and diversity of the ensemble members.  This thesis investigates and develops ensemble techniques for improving this accuracy and diversity, and compares them to other well-known ensemble methods.  These algorithms are shown to successfully promote diversity whilst maintaining the learner accuracy.

An important area of machine learning research is that of feature selection.  Choosing an appropriate subset of the available features with which to represent the data can improve the performance of learning algorithms in terms of accuracy, efficiency and interpretability.  However, this task is non-trivial and can be complicated further through interactions amongst the features, which can result in features only being relevant within a local area of the space.  Through the creation of diverse local models, ensemble methods have the capacity to address these issues and identify feature relevance.  This work develops new methods that utilise these aspects of ensemble algorithms to identify and exploit feature information.

University of Southampton
Rogers, Jeremy D
7d3cfba2-3c5f-4888-aa3c-aa0020147fd5
Rogers, Jeremy D
7d3cfba2-3c5f-4888-aa3c-aa0020147fd5

Rogers, Jeremy D (2007) Ensemble algorithms and feature selection. University of Southampton, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

A popular technique for modelling data is to construct an ensemble of learners and combine them in to a single hypothesis.  This final model can achieve an accuracy that is greater than that of the ensemble members, provided that there is a sufficient level of diversity within these learners.  Measuring and promoting this diversity can be achieved in a variety of ways and typically a trade-off exists between the accuracy and diversity of the ensemble members.  This thesis investigates and develops ensemble techniques for improving this accuracy and diversity, and compares them to other well-known ensemble methods.  These algorithms are shown to successfully promote diversity whilst maintaining the learner accuracy.

An important area of machine learning research is that of feature selection.  Choosing an appropriate subset of the available features with which to represent the data can improve the performance of learning algorithms in terms of accuracy, efficiency and interpretability.  However, this task is non-trivial and can be complicated further through interactions amongst the features, which can result in features only being relevant within a local area of the space.  Through the creation of diverse local models, ensemble methods have the capacity to address these issues and identify feature relevance.  This work develops new methods that utilise these aspects of ensemble algorithms to identify and exploit feature information.

Text
1070662.pdf - Version of Record
Available under License University of Southampton Thesis Licence.
Download (2MB)

More information

Published date: 2007

Identifiers

Local EPrints ID: 466231
URI: http://eprints.soton.ac.uk/id/eprint/466231
PURE UUID: c2654dd5-78d7-4377-8272-05a8d7f3882d

Catalogue record

Date deposited: 05 Jul 2022 04:52
Last modified: 16 Mar 2024 20:34

Export record

Contributors

Author: Jeremy D Rogers

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×