"Fixed Points of Approximate Value Iteration and Temporal-Difference Learning."

Daniela Pucci de Farias, Benjamin Van Roy (2000)

Details and statistics

DOI:

access: unavailable

type: Conference or Workshop Paper

metadata version: 2002-11-26

a service of  Schloss Dagstuhl - Leibniz Center for Informatics