"Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear ..."

Wei Hu, Lechao Xiao, Jeffrey Pennington (2020)
a service of Schloss Dagstuhl - Leibniz Center for Informatics