Distributed randomized multiagent policy iteration in reinforcement learning

We propose a distributed randomized policy iteration algorithm for infinite horizon dynamic programming problems for which the control at each stage is m-dimensional. The traditional policy iteration algorithm involves performing a minimization over an m-dimensional constraint set and has a computat...

Full description

Bibliographic Details
Main Author: Weipeng Zhang
Format: Article
Language:English
Published: Elsevier 2023-09-01
Series:Results in Control and Optimization
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666720723000577