"Q-learning and enhanced policy iteration in discounted dynamic programming."

Dimitri P. Bertsekas, Huizhen Yu (2010)

Details and statistics

DOI: 10.1109/CDC.2010.5717930

access: closed

type: Conference or Workshop Paper

metadata version: 2017-05-19

a service of  Schloss Dagstuhl - Leibniz Center for Informatics