"TD(0) Converges Provably Faster than the Residual Gradient Algorithm."

Ralf Schoknecht, Artur Merke (2003)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics