"Average Reward Optimization Objective In Partially Observable Domains."

Yuri Grinberg, Doina Precup (2013)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics