Approximate policy iteration: A survey and some new methods

We consider the classical policy iteration method of dynamic programming (DP), where approximations and simulation are used to deal with the curse of dimensionality. We survey a number of issues: convergence and rate of convergence of approximate policy evaluation methods, singularity and susceptibi...

Full description

Bibliographic Details
Main Author: Bertsekas, Dimitri P.
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:en_US
Published: Springer-Verlag 2012
Online Access:http://hdl.handle.net/1721.1/73485
https://orcid.org/0000-0001-6909-7208