The University of Southampton
University of Southampton Institutional Repository

Content based image retrieval : analogies with text

Content based image retrieval : analogies with text
Content based image retrieval : analogies with text

This thesis describes research into an area of content based image retrieval (CBIR), that of feature indexing for the purpose of rapid retrieval. The techniques in this thesis draw from the field of text IR and demonstrate that individual image extraction algorithms can be optimised for use with an inverted index, which could lead to CBIR systems capable of sub-second retrieval times on collections of millions of images.

A novel global feature algorithm, QMNS, is presented, which is capable of capturing both colour and texture information in a spatially insensitive manner. Images are divided into regular patches from which dominant colour modes are derived using the mean shift algorithm. The RGB bi-modal colour space is quantised giving a set of labelled feature terms with associated frequencies and the terms inserted into an inverted index. Terms in the index are retrieved with a TF*IDF algorithm.

The performance of QMNS and the index is measured by comparison with an RGB colour histogram, an RGB CCV histogram, and the unquantised MNS features. Precision and recall results show that the indexed feature performs equally as well as the other algorithms for an image collection of photographic images. An analysis of the distribution of each type of feature term was performed, showing that Zipf 's law holds in each case. Quantisation parameters for the algorithms were varied, demonstrating that a tradeoff exists between vocabulary size, the average precision, and the speed of retrieval.

This thesis indicates that the current generation of highly successfully text IR systems, which are implemented using inverted indexes, could provide the basis for very rapid image and multimedia retrieval. The optimisation techniques shown can be used for any quantisable feature, and allow retrieval to be performed without specialised comparison algorithms.

University of Southampton
Westmacott, Mike
9b2ea97b-238c-4043-ae7e-ce221abfd3b4
Westmacott, Mike
9b2ea97b-238c-4043-ae7e-ce221abfd3b4

Westmacott, Mike (2005) Content based image retrieval : analogies with text. University of Southampton, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

This thesis describes research into an area of content based image retrieval (CBIR), that of feature indexing for the purpose of rapid retrieval. The techniques in this thesis draw from the field of text IR and demonstrate that individual image extraction algorithms can be optimised for use with an inverted index, which could lead to CBIR systems capable of sub-second retrieval times on collections of millions of images.

A novel global feature algorithm, QMNS, is presented, which is capable of capturing both colour and texture information in a spatially insensitive manner. Images are divided into regular patches from which dominant colour modes are derived using the mean shift algorithm. The RGB bi-modal colour space is quantised giving a set of labelled feature terms with associated frequencies and the terms inserted into an inverted index. Terms in the index are retrieved with a TF*IDF algorithm.

The performance of QMNS and the index is measured by comparison with an RGB colour histogram, an RGB CCV histogram, and the unquantised MNS features. Precision and recall results show that the indexed feature performs equally as well as the other algorithms for an image collection of photographic images. An analysis of the distribution of each type of feature term was performed, showing that Zipf 's law holds in each case. Quantisation parameters for the algorithms were varied, demonstrating that a tradeoff exists between vocabulary size, the average precision, and the speed of retrieval.

This thesis indicates that the current generation of highly successfully text IR systems, which are implemented using inverted indexes, could provide the basis for very rapid image and multimedia retrieval. The optimisation techniques shown can be used for any quantisable feature, and allow retrieval to be performed without specialised comparison algorithms.

Text
1003833.pdf - Version of Record
Available under License University of Southampton Thesis Licence.
Download (8MB)

More information

Published date: 2005

Identifiers

Local EPrints ID: 465826
URI: http://eprints.soton.ac.uk/id/eprint/465826
PURE UUID: 849265d4-1e2a-4ac5-8065-2e2b331f74bb

Catalogue record

Date deposited: 05 Jul 2022 03:13
Last modified: 16 Mar 2024 20:23

Export record

Contributors

Author: Mike Westmacott

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×