"Average Reward Optimization with Multiple Discounting Reinforcement Learners."

Chris Reinke, Eiji Uchibe, Kenji Doya (2017)

Details and statistics

DOI: 10.1007/978-3-319-70087-8_81

access: closed

type: Conference or Workshop Paper

metadata version: 2017-11-15

a service of  Schloss Dagstuhl - Leibniz Center for Informatics