"Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret."

Haitham Bou-Ammar, Rasul Tutunov, Eric Eaton (2015)
a service of Schloss Dagstuhl - Leibniz Center for Informatics