Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces
Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces
The availability of a large, freely redistributable set of high-quality annotated images is critical to allowing researchers in the area of automatic annotation, generic object recognition and concept detection to compare results. The recent introduction of the MIR Flickr dataset allows researchers such access. A dataset by itself is not enough, and a set of repeatable guidelines for performing evaluations that are comparable is required. In many cases it also is useful to compare the machine-learning components of different automatic annotation techniques using a common set of image features. This paper seeks to provide a solid, repeatable methodology and protocol for performing evaluations of automatic annotation software using the MIR Flickr dataset together with freely available tools for measuring performance in a controlled manner. This protocol is demonstrated through a set of experiments using a “semantic space” auto-annotator previously developed by the authors, in combination with a set of visual term features for the images that has been made publicly available for download. The paper also discusses how much training data is required to train the semantic space annotator with the MIR Flickr dataset. It is the hope of the authors that researchers will adopt this methodology and produce results from their own annotators that can be directly compared to those presented in this work.
Evaluation, Automatic Annotation, Semantic Image Retrieval, Visual-terms, Semantic spaces, Image Content Analysis
978-1-60558-815-5
547-556
Hare, Jonathan
65ba2cda-eaaf-4767-a325-cd845504e5a9
Lewis, Paul
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
31 March 2010
Hare, Jonathan
65ba2cda-eaaf-4767-a325-cd845504e5a9
Lewis, Paul
7aa6c6d9-bc69-4e19-b2ac-a6e20558c020
Hare, Jonathan and Lewis, Paul
(2010)
Automatically Annotating the MIR Flickr Dataset: Experimental Protocols, Openly Available Data and Semantic Spaces.
MIR '10: Proceedings of the international conference on Multimedia information retrieval, Philadelphia, United States.
29 - 31 Mar 2010.
.
Record type:
Conference or Workshop Item
(Paper)
Abstract
The availability of a large, freely redistributable set of high-quality annotated images is critical to allowing researchers in the area of automatic annotation, generic object recognition and concept detection to compare results. The recent introduction of the MIR Flickr dataset allows researchers such access. A dataset by itself is not enough, and a set of repeatable guidelines for performing evaluations that are comparable is required. In many cases it also is useful to compare the machine-learning components of different automatic annotation techniques using a common set of image features. This paper seeks to provide a solid, repeatable methodology and protocol for performing evaluations of automatic annotation software using the MIR Flickr dataset together with freely available tools for measuring performance in a controlled manner. This protocol is demonstrated through a set of experiments using a “semantic space” auto-annotator previously developed by the authors, in combination with a set of visual term features for the images that has been made publicly available for download. The paper also discusses how much training data is required to train the semantic space annotator with the MIR Flickr dataset. It is the hope of the authors that researchers will adopt this methodology and produce results from their own annotators that can be directly compared to those presented in this work.
Text
mirflickr-invited.pdf
- Version of Record
Text
MIRPoster.pdf
- Other
More information
Published date: 31 March 2010
Additional Information:
Event Dates: 29-31 March 2010
Venue - Dates:
MIR '10: Proceedings of the international conference on Multimedia information retrieval, Philadelphia, United States, 2010-03-29 - 2010-03-31
Keywords:
Evaluation, Automatic Annotation, Semantic Image Retrieval, Visual-terms, Semantic spaces, Image Content Analysis
Organisations:
Web & Internet Science
Identifiers
Local EPrints ID: 268800
URI: http://eprints.soton.ac.uk/id/eprint/268800
ISBN: 978-1-60558-815-5
PURE UUID: 979a437d-7eb7-47c8-8e03-35bfedd81c1d
Catalogue record
Date deposited: 01 Apr 2010 14:28
Last modified: 15 Mar 2024 03:25
Export record
Contributors
Author:
Jonathan Hare
Author:
Paul Lewis
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics