Unsupervised Segmentation for Video Using Convolutional VAE and Gaussian Process
-
- NAGANO Masatoshi
- The University of Electro-Communications
-
- NAKAMURA Tomoaki
- The University of Electro-Communications
-
- NAGAI Takayuki
- The University of Electro-Communications Osaka University
-
- MOCHIHASHI Daichi
- Institute of Statistical Mathematics
-
- KOBAYASHI Ichiro
- Ochanomizu University
-
- TAKANO Wataru
- Osaka University
Bibliographic Information
- Other Title
-
- 畳み込み変分オートエンコーダとガウス過程に基づく動画像の分節化
Description
<p>Humans recognize perceived continuous high-dimensional information by dividing it into significant segments such as words and unit motions. We believe that such unsupervised segmentation is also an important ability for robots to learn topics such as language and motions. To this end, we have been proposed the Hierarchical Dirichlet Processes-Variational Autoencoder-Gaussian Process-Hidden Semi-Markov Model (HVGH) which is composed of a deep generative model and a statistical model. HVGH can extract features from high-dimensional time-series data by VAE while simultaneously dividing it into segments by Gaussian process. In this paper, we propose a method that can segment not only high-dimensional time-series data but also videos in an unsupervised manner by improving VAE of HVGH to Convolutional VAE. In an experiment, we used a first-person view video of an agent in the maze to demonstrate that our proposed model estimates more accurate segments than the baseline method.</p>
Journal
-
- Proceedings of the Annual Conference of JSAI
-
Proceedings of the Annual Conference of JSAI JSAI2021 (0), 2J3GS8b01-2J3GS8b01, 2021
The Japanese Society for Artificial Intelligence
- Tweet
Details 詳細情報について
-
- CRID
- 1390288370500613760
-
- NII Article ID
- 130008051749
-
- ISSN
- 27587347
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- CiNii Articles
-
- Abstract License Flag
- Disallowed