Content based image retrieval : analogies with text
Content based image retrieval : analogies with text
This thesis describes research into an area of content based image retrieval (CBIR), that of feature indexing for the purpose of rapid retrieval. The techniques in this thesis draw from the field of text IR and demonstrate that individual image extraction algorithms can be optimised for use with an inverted index, which could lead to CBIR systems capable of sub-second retrieval times on collections of millions of images.
A novel global feature algorithm, QMNS, is presented, which is capable of capturing both colour and texture information in a spatially insensitive manner. Images are divided into regular patches from which dominant colour modes are derived using the mean shift algorithm. The RGB bi-modal colour space is quantised giving a set of labelled feature terms with associated frequencies and the terms inserted into an inverted index. Terms in the index are retrieved with a TF*IDF algorithm.
The performance of QMNS and the index is measured by comparison with an RGB colour histogram, an RGB CCV histogram, and the unquantised MNS features. Precision and recall results show that the indexed feature performs equally as well as the other algorithms for an image collection of photographic images. An analysis of the distribution of each type of feature term was performed, showing that Zipf 's law holds in each case. Quantisation parameters for the algorithms were varied, demonstrating that a tradeoff exists between vocabulary size, the average precision, and the speed of retrieval.
This thesis indicates that the current generation of highly successfully text IR systems, which are implemented using inverted indexes, could provide the basis for very rapid image and multimedia retrieval. The optimisation techniques shown can be used for any quantisable feature, and allow retrieval to be performed without specialised comparison algorithms.
University of Southampton
Westmacott, Mike
9b2ea97b-238c-4043-ae7e-ce221abfd3b4
2005
Westmacott, Mike
9b2ea97b-238c-4043-ae7e-ce221abfd3b4
Westmacott, Mike
(2005)
Content based image retrieval : analogies with text.
University of Southampton, Doctoral Thesis.
Record type:
Thesis
(Doctoral)
Abstract
This thesis describes research into an area of content based image retrieval (CBIR), that of feature indexing for the purpose of rapid retrieval. The techniques in this thesis draw from the field of text IR and demonstrate that individual image extraction algorithms can be optimised for use with an inverted index, which could lead to CBIR systems capable of sub-second retrieval times on collections of millions of images.
A novel global feature algorithm, QMNS, is presented, which is capable of capturing both colour and texture information in a spatially insensitive manner. Images are divided into regular patches from which dominant colour modes are derived using the mean shift algorithm. The RGB bi-modal colour space is quantised giving a set of labelled feature terms with associated frequencies and the terms inserted into an inverted index. Terms in the index are retrieved with a TF*IDF algorithm.
The performance of QMNS and the index is measured by comparison with an RGB colour histogram, an RGB CCV histogram, and the unquantised MNS features. Precision and recall results show that the indexed feature performs equally as well as the other algorithms for an image collection of photographic images. An analysis of the distribution of each type of feature term was performed, showing that Zipf 's law holds in each case. Quantisation parameters for the algorithms were varied, demonstrating that a tradeoff exists between vocabulary size, the average precision, and the speed of retrieval.
This thesis indicates that the current generation of highly successfully text IR systems, which are implemented using inverted indexes, could provide the basis for very rapid image and multimedia retrieval. The optimisation techniques shown can be used for any quantisable feature, and allow retrieval to be performed without specialised comparison algorithms.
Text
1003833.pdf
- Version of Record
More information
Published date: 2005
Identifiers
Local EPrints ID: 465826
URI: http://eprints.soton.ac.uk/id/eprint/465826
PURE UUID: 849265d4-1e2a-4ac5-8065-2e2b331f74bb
Catalogue record
Date deposited: 05 Jul 2022 03:13
Last modified: 16 Mar 2024 20:23
Export record
Contributors
Author:
Mike Westmacott
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics