方策探査法のための多重重点サンプリングを用いた経験再利用

書誌事項

タイトル別名
  • Sample Reuse with Adaptive Multiple Importance Sampling for Policy Search

抄録

<p>In policy search methods, importance sampling is widely used to reutilize samples drawn from previous sampling distributions that are usually different from the current one. Previous studies create uniform mixtures of previous sampling distributions as proposal distribution. To further improve sample efficiency, we introduce adaptive multiple importance sampling that optimizes the mixing coefficients to minimize the variance of the importance sampling estimator. We apply the proposed method to several policy search methods and experimental results on some benchmark control tasks show that all the methods improve sample efficiency.</p>

収録刊行物

詳細情報 詳細情報について

問題の指摘

ページトップへ