-
- Inoue Nakamasa
- Tokyo Institute of Technology
-
- Shinoda Koichi
- Tokyo Institute of Technology
書誌事項
- 公開日
- 2016
- 資源種別
- journal article
- DOI
-
- 10.3169/mta.4.209
- 公開者
- 一般社団法人 映像情報メディア学会
説明
Video semantic indexing, which aims to detect objects, actions and scenes from video data, is one of important research topics in multimedia information processing. In the Text Retrieval Conference Video Retrieval Evaluation (TRECVID) workshop, many fundamental techniques for video processing have been developed and have been shown to be effective for real data such as Internet videos. They include extensions of deep learning techniques and image recognition techniques such as bag of visual words to video data. This paper reviews TRECVID activities with these techniques for semantic indexing. We also show the TokyoTech system using Gaussian-mixture-model (GMM) supervectors and deep convolutional neural networks (CNNs) with its experimental evaluation at TRECVID 2014.
収録刊行物
-
- 映像情報メディア学会英語論文誌
-
映像情報メディア学会英語論文誌 4 (3), 209-217, 2016
一般社団法人 映像情報メディア学会
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390282680401537024
-
- NII論文ID
- 130005161897
-
- ISSN
- 21867364
-
- 本文言語コード
- en
-
- 資料種別
- journal article
-
- データソース種別
-
- JaLC
- Crossref
- CiNii Articles
- KAKEN
- OpenAIRE
-
- 抄録ライセンスフラグ
- 使用不可