[Invited Paper] Semantic Indexing for Large-Scale Video Retrieval

Inoue Nakamasa, Shinoda Koichi

doi:10.3169/mta.4.209

説明

Video semantic indexing, which aims to detect objects, actions and scenes from video data, is one of important research topics in multimedia information processing. In the Text Retrieval Conference Video Retrieval Evaluation (TRECVID) workshop, many fundamental techniques for video processing have been developed and have been shown to be effective for real data such as Internet videos. They include extensions of deep learning techniques and image recognition techniques such as bag of visual words to video data. This paper reviews TRECVID activities with these techniques for semantic indexing. We also show the TokyoTech system using Gaussian-mixture-model (GMM) supervectors and deep convolutional neural networks (CNNs) with its experimental evaluation at TRECVID 2014.

収録刊行物

映像情報メディア学会英語論文誌

映像情報メディア学会英語論文誌 4 (3), 209-217, 2016

一般社団法人映像情報メディア学会

キーワード

詳細情報詳細情報について

CRID: 1390282680401537024

NII論文ID: 130005161897

DOI: 10.3169/mta.4.209

ISSN: 21867364

Web Site: https://www.jstage.jst.go.jp/article/mta/4/3/4_209/_pdf

本文言語コード: en

資料種別: journal article

データソース種別

JaLC
Crossref
CiNii Articles
KAKEN
OpenAIRE

抄録ライセンスフラグ: 使用不可

書き出し

問題の指摘

[Invited Paper] Semantic Indexing for Large-Scale Video Retrieval

書誌事項

説明

収録刊行物

参考文献 (73)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

[Invited Paper] Semantic Indexing for Large-Scale Video Retrieval

書誌事項

説明

収録刊行物

参考文献 (73)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について