Distributed Asynchronous Policy Iteration in Dynamic Programming

We consider the distributed solution of dynamic programming (DP) problems by policy iteration. We envision a network of processors, each updating asynchronously a local policy and a local cost function, defined on a portion of the state space. The computed values are communicated asynchronously...

Full description

Bibliographic Details
Main Authors: Bertsekas, Dimitri P., Yu, Huizhen
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:en_US
Published: University of Illinois at Urbana-Champaign 2011
Online Access:http://hdl.handle.net/1721.1/63169
https://orcid.org/0000-0001-6909-7208