書誌事項
- タイトル別名
-
- Sampling Policy that Improves Performance of Policy in Reinforcement Learning
- キョウカ ガクシュウ ニ オケル ホウサク ノ セイノウ オ コウジョウ スル サンプリング ホウサク
この論文をさがす
抄録
<p>When applying a reinforcement learning method, the estimation accuracy of the state transition probabilities affects the performance of the policy obtained from the estimated plant. Therefore, we find a sampling condition guaranteeing that the optimal policy from the estimated plant is also optimal for the real plant with the desired degree of reliability, and a sampling methods based on it is proposed. Not by the sampling for the reliability in which the policy is optimal for the real plant, but by the sampling for the policy to be effective irrespective of estimation errors, we can further reduce the number of samples. We show the problem setting for finding the policy which is guaranteed to be effective for estimation errors from the real transition probabilities with the desired degree of reliability, and we propose a sampling method as a solution of this problem. The effectiveness of the proposed method is verified by numerical simulations.</p>
収録刊行物
-
- 計測自動制御学会論文集
-
計測自動制御学会論文集 54 (3), 365-372, 2018
公益社団法人 計測自動制御学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390282679486391808
-
- NII論文ID
- 130006512912
-
- NII書誌ID
- AN00072392
-
- ISSN
- 18838189
- 04534654
-
- NDL書誌ID
- 028916196
-
- 本文言語コード
- ja
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- 抄録ライセンスフラグ
- 使用不可