"Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients."

Daniel Hennes et al. (2020)
a service of Schloss Dagstuhl - Leibniz Center for Informatics