ボルツマン選択を用いたDeep Q Network

  • 北 悠人
    千葉工業大学大学院情報科学研究科情報科学専攻
  • 山口 智
    千葉工業大学情報科学部情報工学科

書誌事項

タイトル別名
  • A Deep Q Network with Boltzmann Selection
  • ボルツマン センタク オ モチイタ Deep Q Network

この論文をさがす

説明

<p>The reinforcement learning is a method of training for an agent for accomplishing task by selecting suitable action from the current state. Deep Q network is combining convolutional network with Q-learning. By using the Convolutional Neural Network, Deep Q Network can apply to large dimentional input state tasks without special pre-processing. However Deep Q Network needs a large iteration for getting excellent outputs. The reason of that the Deep Q Network is using ε-greedy for action selection, and the ε is set to high value (close to one) in initial stage in learning. High ε value means that the agent selects action randomly in the learning. Hence, the agent needs large number of iteration of learning for accomplishing a task. In this paper adopts the Boltzmann selection to Deep Q Network. Finally, our algorithm has been applied to 2 kinds of arcade learning environment tasks, and results showed that our algorithm is better than ordinary Deep Q Network.</p>

収録刊行物

参考文献 (4)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ