Ensemble algorithms and feature selection
Ensemble algorithms and feature selection
A popular technique for modelling data is to construct an ensemble of learners and combine them in to a single hypothesis. This final model can achieve an accuracy that is greater than that of the ensemble members, provided that there is a sufficient level of diversity within these learners. Measuring and promoting this diversity can be achieved in a variety of ways and typically a trade-off exists between the accuracy and diversity of the ensemble members. This thesis investigates and develops ensemble techniques for improving this accuracy and diversity, and compares them to other well-known ensemble methods. These algorithms are shown to successfully promote diversity whilst maintaining the learner accuracy.
An important area of machine learning research is that of feature selection. Choosing an appropriate subset of the available features with which to represent the data can improve the performance of learning algorithms in terms of accuracy, efficiency and interpretability. However, this task is non-trivial and can be complicated further through interactions amongst the features, which can result in features only being relevant within a local area of the space. Through the creation of diverse local models, ensemble methods have the capacity to address these issues and identify feature relevance. This work develops new methods that utilise these aspects of ensemble algorithms to identify and exploit feature information.
University of Southampton
Rogers, Jeremy D
7d3cfba2-3c5f-4888-aa3c-aa0020147fd5
2007
Rogers, Jeremy D
7d3cfba2-3c5f-4888-aa3c-aa0020147fd5
Rogers, Jeremy D
(2007)
Ensemble algorithms and feature selection.
University of Southampton, Doctoral Thesis.
Record type:
Thesis
(Doctoral)
Abstract
A popular technique for modelling data is to construct an ensemble of learners and combine them in to a single hypothesis. This final model can achieve an accuracy that is greater than that of the ensemble members, provided that there is a sufficient level of diversity within these learners. Measuring and promoting this diversity can be achieved in a variety of ways and typically a trade-off exists between the accuracy and diversity of the ensemble members. This thesis investigates and develops ensemble techniques for improving this accuracy and diversity, and compares them to other well-known ensemble methods. These algorithms are shown to successfully promote diversity whilst maintaining the learner accuracy.
An important area of machine learning research is that of feature selection. Choosing an appropriate subset of the available features with which to represent the data can improve the performance of learning algorithms in terms of accuracy, efficiency and interpretability. However, this task is non-trivial and can be complicated further through interactions amongst the features, which can result in features only being relevant within a local area of the space. Through the creation of diverse local models, ensemble methods have the capacity to address these issues and identify feature relevance. This work develops new methods that utilise these aspects of ensemble algorithms to identify and exploit feature information.
Text
1070662.pdf
- Version of Record
More information
Published date: 2007
Identifiers
Local EPrints ID: 466231
URI: http://eprints.soton.ac.uk/id/eprint/466231
PURE UUID: c2654dd5-78d7-4377-8272-05a8d7f3882d
Catalogue record
Date deposited: 05 Jul 2022 04:52
Last modified: 16 Mar 2024 20:34
Export record
Contributors
Author:
Jeremy D Rogers
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics