"Finite-Time Regret Bounds for the Multiarmed Bandit Problem."

Nicolò Cesa-Bianchi, Paul Fischer (1998)
a service of Schloss Dagstuhl - Leibniz Center for Informatics