"Analysis and improvement of policy gradient estimation."

Tingting Zhao et al. (2012)
a service of Schloss Dagstuhl - Leibniz Center for Informatics