Learning tag relevance by neighbor voting for social image retrieval

Authors
Publication date 2008
Book title Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval (MIR 2008)
ISBN
  • 9781605583129
Event 1st ACM International Conference on Multimedia Information Retrieval (MIR 2008), Vancouver, Canada
Pages (from-to) 180-187
Publisher New York, NY: Association for Computing Machinery (ACM)
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Social image retrieval is important for exploiting the increasing amounts of amateur-tagged multimedia such as Flickr images. Since amateur tagging is known to be uncontrolled, ambiguous, and personalized, a fundamental problem is how to reliably interpret the relevance of a tag with respect to the visual content it is describing. Intuitively, if different persons label similar images using the same tags, these tags are likely to reflect objective aspects of the visual content. Starting from this intuition, we propose a novel algorithm that scalably and reliably learns tag relevance by accumulating votes from visually similar neighbors. Further, treated as tag frequency, learned tag relevance is seamlessly embedded into current tag-based social image retrieval paradigms.
Preliminary experiments on one million Flickr images demonstrate the potential of the proposed algorithm. Overall comparisons for both single-word queries and multiple-word queries show substantial improvement over the baseline by learning and using tag relevance. Specifically, compared with the baseline using the original tags, on average, retrieval using improved tags increases mean average precision by 24%, from 0.54 to 0.67. Moreover, simulated experiments indicate that performance can be improved further by scaling up the amount of images used in the proposed neighbor voting algorithm.
Document type Conference contribution
Published at http://doi.acm.org/10.1145/1460096.1460126
Permalink to this page
Back