"Softmax Deep Double Deterministic Policy Gradients."

Ling Pan, Qingpeng Cai, Longbo Huang (2020)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics