Linear convergence of a policy gradient method for some finite horizon continuous time control problems

Despite its popularity in the reinforcement learning community, a provably convergent policy gradient method for continuous space-time control problems with nonlinear state dynamics has been elusive. This paper proposes proximal gradient algorithms for feedback controls of finite-time horizon stocha...

Full description

Bibliographic Details
Main Authors: Reisinger, C, Stockinger, W, Zhang, Y
Format: Journal article
Language:English
Published: Society for Industrial and Applied Mathematics 2023