The University of Southampton
University of Southampton Institutional Repository

A Review of Codebook Models in Patch-Based Visual Object Recognition

A Review of Codebook Models in Patch-Based Visual Object Recognition
A Review of Codebook Models in Patch-Based Visual Object Recognition
The codebook model-based approach, while ignoring any structural aspect in vision, nonetheless provides state-of-the-art performances on current datasets. The key role of a visual codebook is to provide a way to map the low-level features into a fixed-length vector in histogram space to which standard classifiers can be directly applied. The discriminative power of such a visual codebook determines the quality of the codebook model, whereas the size of the codebook controls the complexity of the model. Thus, the construction of a codebook is an important step which is usually done by cluster analysis. However, clustering is a process that retains regions of high density in a distribution and it follows that the resulting codebook need not have discriminant properties. This is also recognised as a computational bottleneck of such systems. In our recent work, we proposed a resource-allocating codebook, to constructing a discriminant codebook in a one-pass design procedure that slightly outperforms more traditional approaches at drastically reduced computing times. In this review we survey several approaches that have been proposed over the last decade with their use of feature detectors, descriptors, codebook construction schemes, choice of classifiers in recognising objects, and datasets that were used in evaluating the proposed methods.
Ramanan, Amirthalingam
4b287910-5234-42ef-83f0-d9875c319a56
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Ramanan, Amirthalingam
4b287910-5234-42ef-83f0-d9875c319a56
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f

Ramanan, Amirthalingam and Niranjan, Mahesan (2011) A Review of Codebook Models in Patch-Based Visual Object Recognition. JOURNAL OF SIGNAL PROCESSING SYSTEMS, DOI: 10.1007/s11265-011-0622-x.

Record type: Article

Abstract

The codebook model-based approach, while ignoring any structural aspect in vision, nonetheless provides state-of-the-art performances on current datasets. The key role of a visual codebook is to provide a way to map the low-level features into a fixed-length vector in histogram space to which standard classifiers can be directly applied. The discriminative power of such a visual codebook determines the quality of the codebook model, whereas the size of the codebook controls the complexity of the model. Thus, the construction of a codebook is an important step which is usually done by cluster analysis. However, clustering is a process that retains regions of high density in a distribution and it follows that the resulting codebook need not have discriminant properties. This is also recognised as a computational bottleneck of such systems. In our recent work, we proposed a resource-allocating codebook, to constructing a discriminant codebook in a one-pass design procedure that slightly outperforms more traditional approaches at drastically reduced computing times. In this review we survey several approaches that have been proposed over the last decade with their use of feature detectors, descriptors, codebook construction schemes, choice of classifiers in recognising objects, and datasets that were used in evaluating the proposed methods.

Text
RamananJSPS2011.pdf - Accepted Manuscript
Download (849kB)

More information

Published date: September 2011
Organisations: Southampton Wireless Group

Identifiers

Local EPrints ID: 272867
URI: http://eprints.soton.ac.uk/id/eprint/272867
PURE UUID: f63248b4-4cd2-4523-908c-8cbd7539aab8
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 28 Sep 2011 08:37
Last modified: 15 Mar 2024 03:29

Export record

Contributors

Author: Amirthalingam Ramanan
Author: Mahesan Niranjan ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×