The University of Southampton
University of Southampton Institutional Repository

The classification of incomplete vectors

The classification of incomplete vectors
The classification of incomplete vectors

The thesis begins with a brief introduction to Pattern Recognition followed by a broad survey of the missing data problem in other fields. Possible ways to approach the problem of classifying incomplete vectors are considered. The author argues that substituting estimates for missing components usually leads to a sub-optimal classification, that decision surface methods can be applied in some cases, and that probability density function estimation can provide a general solution.A decision surface method is developed which defines the decision surfaces by point sets selected from. the training set. These point sets are selected by an edited condensed nearest neighbour rule devised by the author; the need to store points more than once is avoided by labelling the points.Two types of pdf estimation methods are defined: local and global. The local method leads to classifications obtained directly from the training set, while the global method v yields decisions via substitution in parametric estimators. The author investigates local nonparametric estimators in detail, explaining their advantages and disadvantages fdr the current application. It is argued that series estimators cannot be usefully applied to the general problem.A general model.suitable for the global method is presented, and different specialisations of it are shown. By deliberately restricting the number of comparisons between these specialisation a greater statistical validity is given to the conclusions. The constrained non-linear optimisation methods used in estimating the parameters of the models are explained, as well as initialisation methods and choice of objective functions. Further particular models suitable for the global methods are also shown.

University of Southampton
Hand, David John
Hand, David John

Hand, David John (1977) The classification of incomplete vectors. University of Southampton, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

The thesis begins with a brief introduction to Pattern Recognition followed by a broad survey of the missing data problem in other fields. Possible ways to approach the problem of classifying incomplete vectors are considered. The author argues that substituting estimates for missing components usually leads to a sub-optimal classification, that decision surface methods can be applied in some cases, and that probability density function estimation can provide a general solution.A decision surface method is developed which defines the decision surfaces by point sets selected from. the training set. These point sets are selected by an edited condensed nearest neighbour rule devised by the author; the need to store points more than once is avoided by labelling the points.Two types of pdf estimation methods are defined: local and global. The local method leads to classifications obtained directly from the training set, while the global method v yields decisions via substitution in parametric estimators. The author investigates local nonparametric estimators in detail, explaining their advantages and disadvantages fdr the current application. It is argued that series estimators cannot be usefully applied to the general problem.A general model.suitable for the global method is presented, and different specialisations of it are shown. By deliberately restricting the number of comparisons between these specialisation a greater statistical validity is given to the conclusions. The constrained non-linear optimisation methods used in estimating the parameters of the models are explained, as well as initialisation methods and choice of objective functions. Further particular models suitable for the global methods are also shown.

This record has no associated files available for download.

More information

Published date: 1977

Identifiers

Local EPrints ID: 462685
URI: http://eprints.soton.ac.uk/id/eprint/462685
PURE UUID: 62128143-e6b8-4398-b3c2-ed003cae7a46

Catalogue record

Date deposited: 04 Jul 2022 19:40
Last modified: 04 Jul 2022 19:40

Export record

Contributors

Author: David John Hand

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×