Evaluation of dimensionality reduction methods for image auto-annotation

DOI 被引用文献2件 オープンアクセス

説明

Image auto-annotation is a challenging task in computer vision. The goal of this task is to predict multiple words for generic images automatically. Recent state-of-theart methods are based on a non-parametric approach that uses several visual features to calculate distances between image samples. While this approach is successful from the viewpoint of annotation accuracy, the computational costs, in terms of both complexity and memory use, tend to be high, since non-parametric methods require many training instances to be stored in memory to compute distances from a query. In this paper, we investigate several linear dimensionality reduction methods for efficient image annotation. Using the additional information provided by multiple labels, we can obtain a small representation preserving (and hopefully improving) the semantic distance of a visual feature. Linear methods are computationally reasonable and are suitable for practical large-scale systems, although only limited comparison of such methods is available in this research field. Extensive experiments and analyses on various datasets and visual features show how these simple methods can be applied effectively to image annotation.

収録刊行物

被引用文献 (2)*注記

もっと見る

詳細情報 詳細情報について

  • CRID
    1360576284472966912
  • DOI
    10.5244/c.24.94
  • データソース種別
    • Crossref
    • OpenAIRE

問題の指摘

ページトップへ