"Learning in Structured MDPs with Convex Cost Functions: Improved Regret ..."

Shipra Agrawal, Randy Jia (2019)

Details and statistics

DOI: 10.1145/3328526.3329565

access: closed

type: Conference or Workshop Paper

metadata version: 2019-06-26

a service of  Schloss Dagstuhl - Leibniz Center for Informatics