"Win or Learn Fast Proximal Policy Optimisation."

Dino Stephen Ratcliffe, Katja Hofmann, Sam Devlin (2019)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics