"Counterfactual Risk Minimization: Learning from Logged Bandit Feedback."

Adith Swaminathan, Thorsten Joachims (2015)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics