書誌事項
- タイトル別名
-
- Span Seminorm Approach to Controlled Markov Set-Chains
この論文をさがす
説明
type:text
In a controlled Markov set-chain with finite state and action spaces, we find a policy, called average-optimal, which maximizes Cesaro sums of each time's reward over all stationaly policies under some partial order. Under uniformly scrambling conditions, the dynamic programming operator for our model is proved to be a contraction in a span seminorm. And, analysing the behavior of expected total rewards over the T-horizon as T approaches ∞ by a fixed point of a span-contraction operator we give a constructive proof for the existence of an average-optimal policy.
source:Bulletin of the Faculty of Education, Chiba University. III, Natural sciences
収録刊行物
-
- 千葉大学教育学部研究紀要. III, 自然科学編
-
千葉大学教育学部研究紀要. III, 自然科学編 46 13-23, 1998-02-28
千葉大学教育学部
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1050007072213340672
-
- NII論文ID
- 110004624632
-
- NII書誌ID
- AN10494753
-
- ISSN
- 13427423
-
- NDL書誌ID
- 4654678
-
- 本文言語コード
- en
-
- 資料種別
- departmental bulletin paper
-
- データソース種別
-
- IRDB
- NDLサーチ
- CiNii Articles