Sample Reuse with Adaptive Multiple Importance Sampling for Policy Search

Bibliographic Information

Other Title
  • 方策探査法のための多重重点サンプリングを用いた経験再利用

Abstract

<p>In policy search methods, importance sampling is widely used to reutilize samples drawn from previous sampling distributions that are usually different from the current one. Previous studies create uniform mixtures of previous sampling distributions as proposal distribution. To further improve sample efficiency, we introduce adaptive multiple importance sampling that optimizes the mixing coefficients to minimize the variance of the importance sampling estimator. We apply the proposed method to several policy search methods and experimental results on some benchmark control tasks show that all the methods improve sample efficiency.</p>

Journal

Details 詳細情報について

Report a problem

Back to top