"Regret Bounds for Reinforcement Learning via Markov Chain Concentration."

Ronald Ortner (2020)

Details and statistics

DOI: 10.1613/JAIR.1.11316

access: open

type: Journal Article

metadata version: 2021-10-14

a service of  Schloss Dagstuhl - Leibniz Center for Informatics