Dimitri Bertsekas: "Distributed and Multiagent Reinforcement Learning"
Video Statistics and Information
Views: 4,628
Rating: 4.9615383 out of 5
Keywords: ipam, math, mathematics, ucla, Dimitri Bertsekas, Massachusetts Institute of Technology, Arizona State University, reinforcement learning, asynchronous computation, programming
Id: nTPuL6iVuwU
Channel Id: undefined
Length: 57min 51sec (3471 seconds)
Published: Thu Apr 09 2020
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.
"Distributed and Multiagent Reinforcement Learning" Dimitri Bertsekas - Massachusetts Institute of Technology & Arizona State University
Abstract: We discuss issues of parallelization and distributed asynchronous computation for large scale dynamic programming problems. We first focus on asynchronous policy iteration with multiprocessor systems using state-partitioned architectures. Exact convergence results are given for the case of lookup table representations, and error bounds are given for their compact representation counterparts. A computational study is presented with POMDP problems with more than 10^15 states. In a related context, we introduce multiagent on-line schemes, whereby at each stage, each agent's decision is made by executing a local rollout algorithm that uses a base policy, together with some coordinating information from the other agents. The amount of local computation required at every stage by each agent is independent of the number of agents, while the amount of global computation (over all agents) grows linearly with the number of agents. By contrast, with the standard rollout algorithm, the amount of global computation grows exponentially with the number of agents. Despite the drastic reduction in required computation, we show that our algorithm has the fundamental cost improvement property of rollout: an improved performance relative to the base policy.
Institute for Pure and Applied Mathematics, UCLA February 24, 2020