- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Automatic Translation feature is available on CiNii Labs
- Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Apprenticeship Learning for Model Parameters of Partially Observable Environments
Bibliographic Information
- Other Title
-
- 部分観測環境のモデルパラメータに対する徒弟学習
- ブブン カンソク カンキョウ ノ モデルパラメータ ニ タイスル トテイ ガクシュウ
- 情報論的学習理論と機械学習
- ジョウホウロンテキ ガクシュウ リロン ト キカイ ガクシュウ
Search this article
Description
We consider apprenticeship learning, i.e., having an agent learn a task by observing an expert demonstrating the task in a partially observable environment when the model of the environment is uncertain. This setting is useful in applications where the explicit modeling of the environment is difficult, such as a dialogue system. We show that we can extract information about the environment model by inferring action selection process behind the demonstration, under the assumption that the expert is choosing optimal actions based on knowledge of the true model of the target environment. Proposed algorithms can achieve more accurate estimates of POMDP parameters and better policies from a short demonstration, compared to methods that learns only from the reaction from the environment.
Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)
Journal
-
- 電子情報通信学会技術研究報告 = IEICE technical report : 信学技報
-
電子情報通信学会技術研究報告 = IEICE technical report : 信学技報 111 (480), 49-54, 2012-03
東京 : 電子情報通信学会
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1520009408089650944
-
- NII Article ID
- 110009545975
-
- NII Book ID
- AA12482480
-
- ISSN
- 09135685
-
- NDL BIB ID
- 023569927
-
- Text Lang
- en
-
- Article Type
- journal article
-
- NDL Source Classification
-
- ZN33(科学技術--電気工学・電気機械工業--電子工学・電気通信)
-
- Data Source
-
- NDL Search
- CiNii Articles
- KAKEN
- OpenAIRE