"Model-based Reinforcement Learning from Signal Temporal Logic Specifications."

Parv Kapoor, Anand Balakrishnan, Jyotirmoy V. Deshmukh (2020)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics