著者名,書名,版表示,出版者名,出版年,シリーズ名,番号,ISBN,ISSN,URL "Lapan, Maxim","Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more",,Packt,2018,Expert insight,,9781788834247,,https://cir.nii.ac.jp/crid/1130000796572617728