Approximate policy iteration: A survey and some new methods

We consider the classical policy iteration method of dynamic programming (DP), where approximations and simulation are used to deal with the curse of dimensionality. We survey a number of issues: convergence and rate of convergence of approximate policy evaluation methods, singularity and susceptibi...

Full description

Bibliographic Details
Main Author:	Bertsekas, Dimitri P.
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	Springer-Verlag 2012
Online Access:	http://hdl.handle.net/1721.1/73485 https://orcid.org/0000-0001-6909-7208

Internet

http://hdl.handle.net/1721.1/73485
https://orcid.org/0000-0001-6909-7208

Approximate policy iteration: A survey and some new methods

Internet

Similar Items