Q-Learning using Retrospective Kalman Filters

Takao Miura, Kei Takahata

doi:10.1109/iiai-aai50415.2020.00063

【2025年5月12日更新】CiNii Dissertations及びCiNii BooksのCiNii Researchへの統合について
CiNii Researchナレッジグラフ検索機能（試行版）をCiNii Labsにて公開しました
【2025年6月30日更新】日経BP社提供データの更新停止及び削除について
「研究データ」「根拠データ」の収録について

Q-Learning using Retrospective Kalman Filters

DOI

Takao Miura
Kei Takahata

説明

Reinforcement Learning allows us to acquire knowledge without any training data. However, for learning it takes time. We discuss a case in which an agent receives a large negative reward. We assume that the reverse action allows us to improve the current situation. In this work, we propose a method to perform Reverse action by using Retrospective Kalman Filter that estimates the state one step before. We show an experience by a Hunter Prey problem. And discuss the usefulness of our proposed method.

収録刊行物

2020 9th International Congress on Advanced Applied Informatics (IIAI-AAI)

2020 9th International Congress on Advanced Applied Informatics (IIAI-AAI) 284-289, 2020-09-01

IEEE

詳細情報詳細情報について

CRID

1870865117545709440
DOI

10.1109/iiai-aai50415.2020.00063
データソース種別
- OpenAIRE

書き出し

RefWorksに書き出し
EndNoteに書き出し
Mendeleyに書き出し
RDFで書き出し
Refer/BibIXで表示
RISで表示
BibTeXで表示
TSVで表示
CSVで表示
JSON-LDで表示

問題の指摘

ページトップへ

Q-Learning using Retrospective Kalman Filters

説明

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について