Evaluation of dimensionality reduction methods for image auto-annotation

Hideki Nakayama, Tatsuya Harada, Yasuo Kuniyoshi

doi:10.5244/c.24.94

Image auto-annotation is a challenging task in computer vision. The goal of this task is to predict multiple words for generic images automatically. Recent state-of-theart methods are based on a non-parametric approach that uses several visual features to calculate distances between image samples. While this approach is successful from the viewpoint of annotation accuracy, the computational costs, in terms of both complexity and memory use, tend to be high, since non-parametric methods require many training instances to be stored in memory to compute distances from a query. In this paper, we investigate several linear dimensionality reduction methods for efficient image annotation. Using the additional information provided by multiple labels, we can obtain a small representation preserving (and hopefully improving) the semantic distance of a visual feature. Linear methods are computationally reasonable and are suitable for practical large-scale systems, although only limited comparison of such methods is available in this research field. Extensive experiments and analyses on various datasets and visual features show how these simple methods can be applied effectively to image annotation.

Evaluation of dimensionality reduction methods for image auto-annotation

説明

収録刊行物

被引用文献 (2)*注記

詳細情報詳細情報について

書き出し

問題の指摘

Evaluation of dimensionality reduction methods for image auto-annotation

説明

収録刊行物

被引用文献 (2)*注記

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について