著者名,論文名,雑誌名,ISSN,出版者名,出版日付,巻,号,ページ,URL,URL(DOI) MAHADEVAN S.,Self-improving factory simulation using continuous-time average-reward reinforcement learning,"Proceedings of the 14th International Conference on Machine Learning, 1997",,Morgan Kaufmann,1997,,,,https://cir.nii.ac.jp/crid/1571698600213893504,