The University of Southampton
University of Southampton Institutional Repository

Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces

Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces
Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces
The availability of a large, freely redistributable set of high-quality annotated images is critical to allowing researchers in the area of automatic annotation, generic object recognition and concept detection to compare results. The recent introduction of the MIR Flickr dataset allows researchers such access. A dataset by itself is not enough, and a set of repeatable guidelines for performing evaluations that are comparable is required. In many cases it also is useful to compare the machine-learning components of different automatic annotation techniques using a common set of image features. This paper seeks to provide a solid, repeatable methodology and protocol for performing evaluations of automatic annotation software using the MIR Flickr dataset together with freely available tools for measuring performance in a controlled manner. This protocol is demonstrated through a set of experiments using a “semantic space” auto-annotator previously developed by the authors, in combination with a set of visual term features for the images that has been made publicly available for download. The paper also discusses how much training data is required to train the semantic space annotator with the MIR Flickr dataset. It is the hope of the authors that researchers will adopt this methodology and produce results from their own annotators that can be directly compared to those presented in this work.
Evaluation, Automatic Annotation, Semantic Image Retrieval, Visual-terms, Semantic spaces, Image Content Analysis
978-1-60558-815-5
547-556
Hare, Jonathan
65ba2cda-eaaf-4767-a325-cd845504e5a9
Lewis, Paul
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Hare, Jonathan
65ba2cda-eaaf-4767-a325-cd845504e5a9
Lewis, Paul
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020

Hare, Jonathan and Lewis, Paul (2010) Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces. MIR '10: Proceedings of the international conference on Multimedia information retrieval, Philadelphia, United States. 29 - 31 Mar 2010. pp. 547-556 .

Record type: Conference or Workshop Item (Paper)

Abstract

The availability of a large, freely redistributable set of high-quality annotated images is critical to allowing researchers in the area of automatic annotation, generic object recognition and concept detection to compare results. The recent introduction of the MIR Flickr dataset allows researchers such access. A dataset by itself is not enough, and a set of repeatable guidelines for performing evaluations that are comparable is required. In many cases it also is useful to compare the machine-learning components of different automatic annotation techniques using a common set of image features. This paper seeks to provide a solid, repeatable methodology and protocol for performing evaluations of automatic annotation software using the MIR Flickr dataset together with freely available tools for measuring performance in a controlled manner. This protocol is demonstrated through a set of experiments using a “semantic space” auto-annotator previously developed by the authors, in combination with a set of visual term features for the images that has been made publicly available for download. The paper also discusses how much training data is required to train the semantic space annotator with the MIR Flickr dataset. It is the hope of the authors that researchers will adopt this methodology and produce results from their own annotators that can be directly compared to those presented in this work.

Text
mirflickr-invited.pdf - Version of Record
Download (1MB)
Text
MIRPoster.pdf - Other
Download (3MB)

More information

Published date: 31 March 2010
Additional Information: Event Dates: 29-31 March 2010
Venue - Dates: MIR '10: Proceedings of the international conference on Multimedia information retrieval, Philadelphia, United States, 2010-03-29 - 2010-03-31
Keywords: Evaluation, Automatic Annotation, Semantic Image Retrieval, Visual-terms, Semantic spaces, Image Content Analysis
Organisations: Web & Internet Science

Identifiers

Local EPrints ID: 268800
URI: http://eprints.soton.ac.uk/id/eprint/268800
ISBN: 978-1-60558-815-5
PURE UUID: 979a437d-7eb7-47c8-8e03-35bfedd81c1d
ORCID for Jonathan Hare: ORCID iD orcid.org/0000-0003-2921-4283

Catalogue record

Date deposited: 01 Apr 2010 14:28
Last modified: 15 Mar 2024 03:25

Export record

Contributors

Author: Jonathan Hare ORCID iD
Author: Paul Lewis

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×