"Distributed consensus-based multi-agent temporal-difference learning."

Milos S. Stankovic, Marko Beko, Srdjan S. Stankovic (2023)

Details and statistics

DOI: 10.1016/J.AUTOMATICA.2023.110922

access: open

type: Journal Article

metadata version: 2023-06-26

a service of  Schloss Dagstuhl - Leibniz Center for Informatics