A Span Seminorm Approach to Controlled Markov Set-Chains

Hosaka, Masanori, Kurano, Masami

書誌事項

タイトル別名

Span Seminorm Approach to Controlled Markov Set-Chains

この論文をさがす

説明

type:text

In a controlled Markov set-chain with finite state and action spaces, we find a policy, called average-optimal, which maximizes Cesaro sums of each time's reward over all stationaly policies under some partial order. Under uniformly scrambling conditions, the dynamic programming operator for our model is proved to be a contraction in a span seminorm. And, analysing the behavior of expected total rewards over the T-horizon as T approaches ∞ by a fixed point of a span-contraction operator we give a constructive proof for the existence of an average-optimal policy.

source:Bulletin of the Faculty of Education, Chiba University. III, Natural sciences

収録刊行物

千葉大学教育学部研究紀要. III, 自然科学編

千葉大学教育学部研究紀要. III, 自然科学編 46 13-23, 1998-02-28

千葉大学教育学部

キーワード

詳細情報詳細情報について

CRID: 1050007072213340672

NII論文ID: 110004624632

NII書誌ID: AN10494753

ISSN: 13427423

NDL書誌ID: 4654678

Web Site: https://opac.ll.chiba-u.jp/da/curator/900024419/; http://id.ndl.go.jp/bib/4654678; https://ndlsearch.ndl.go.jp/books/R000000004-I4654678

本文言語コード: en

資料種別: departmental bulletin paper

データソース種別

IRDB
NDLサーチ
CiNii Articles

書き出し

問題の指摘