著者名,書名,版表示,出版者名,出版年,シリーズ名,番号,ISBN,ISSN,URL "Wal, J. van der",Stochastic dynamic programming : successive approximations and nearly optimal strategies for Markov decision processes and Markov games,2nd ed.,Mathematisch Centrum,1984,Mathematical Centre tracts,,9061962188,,https://cir.nii.ac.jp/crid/1130282270872001280