Saliency for Image Description and Retrieval

Hare, Jonathon S. (2006) Saliency for Image Description and Retrieval. University of Southampton, School of Electronics and Computer Science, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

We live in a world where we are surrounded by ever increasing numbers of images. More often than not, these images have very little metadata by which they can be indexed and searched. In order to avoid information overload, techniques need to be developed to enable these image collections to be searched by their content. Much of the previous work on image retrieval has used global features such as colour and texture to describe the content of the image. However, these global features are insufficient to accurately describe the image content when different parts of the image have different characteristics. This thesis initially discusses how this problem can be circumvented by using salient interest regions to select the areas of the image that are most interesting and generate local descriptors to describe the image characteristics in that region. The thesis discusses a number of different saliency detectors that are suitable for robust retrieval purposes and performs a comparison between a number of these region detectors. The thesis then discusses how salient regions can be used for image retrieval using a number of techniques, but most importantly, two techniques inspired from the field of textual information retrieval. Using these robust retrieval techniques, a new paradigm in image retrieval is discussed, whereby the retrieval takes place on a mobile device using a query image captured by a built-in camera. This paradigm is demonstrated in the context of an art gallery, in which the device can be used to find more information about particular images. The final chapter of the thesis discusses some approaches to bridging the semantic gap in image retrieval. The chapter explores ways in which un-annotated image collections can be searched by keyword. Two techniques are discussed; the first explicitly attempts to automatically annotate the un-annotated images so that the automatically applied annotations can be used for searching. The second approach does not try to explicitly annotate images, but rather, through the use of linear algebra, it attempts to create a semantic space in which images and keywords are positioned such that images are close to the keywords that represent them within the space.

Text

Thesis.pdf - Other

Download (8MB)

More information

Published date: April 2006

Keywords: content-based retrieval, sementic space, salient regions

Organisations: University of Southampton, Web & Internet Science

Identifiers

Local EPrints ID: 262437

URI: http://eprints.soton.ac.uk/id/eprint/262437

PURE UUID: 50f9c6a0-9fbd-4300-a9c1-78f90955ba24

ORCID for Jonathon S. Hare:

orcid.org/0000-0003-2921-4283

Catalogue record

Date deposited: 30 Apr 2006

Last modified: 15 Mar 2024 03:25

Export record

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: Jonathon S. Hare

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information