On the Convergence of Stochastic Iterative Dynamic Programming Algorithms

On the Convergence of Stochastic Iterative Dynamic Programming Algorithms

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms, including the TD(lambda) algorithm of Sutton (1988) and the Q-learning algorithm of Watkins (1989), can be motivated he...

书目详细资料
Main Authors:	Jaakkola, Tommi, Jordan, Michael I., Singh, Satinder P.
语言:	en_US
出版:	2004
主题:	reinforcement learning stochastic approximation sconvergence dynamic programming
在线阅读:	http://hdl.handle.net/1721.1/7205

相似书籍

A non-iterative distributed approximate dynamic programming algorithm for frequency security-constrained stochastic economic dispatch
由: Xiangyong Feng, et al.
出版: (2025-05-01)

Stochastic approximation and its applications /
由: 575513 Chen, Hanfu
出版: (2002)

Approximation and weak convergence methods for random processes, with applications to stochastic systems theory /
由: 272176 Kushner, Harold J. (Harold Joseph), 1933-
出版: (1984)

Stochastic dynamic programming and the control of queueing systems /
由: Sennott, Linn I., 1943-
出版: (1999)

Generalized bounds for convex multistage stochastic programs /
由: Kuhn, Daniel, 1975-
出版: (2005)

Dynamic and stochastic control
由: 196299 Bertsekas, Dimitri P.
出版: (1976)

Dynamic shortest path in stochastic dynamic networks : ship routing problem /
由: 261086 Teoh, Gaik Hoon
出版: (2004)

Stochastic approximation and recursive algorithms and applications /
由: 272176 Kushner, Harold J., et al.
出版: (2003)

Dynamic shortest path in stochastic dynamic networks : ship routing problem [compact disc] /
由: 261086 Teoh, Gaik Hoon
出版: (2004)

A simple method for measuring the performance of stochastic algorithms /
由: 465581 Nasaruddin Zenon, et al.
出版: (2003)

Stochastic approximation /
由: 405479 Wasan, M.
出版: (1969)

6.231 Dynamic Programming and Stochastic Control, Fall 2011
由: Bertsekas, Dimitri
出版: (2011)

Stochastic programming
出版: (1980)

Stochastic programming /
由: Kolbin, V. V. (Viacheslav Viktorovich), 1941-
出版: (1977)

Stochastic programming
出版: (1986)

Stochastic programming 84 /
由: Prekopa, A.
出版: (1986)

6.231 Dynamic Programming and Stochastic Control, Fall 2008
由: Bertsekas, Dimitri
出版: (2008)

Introduction to stochastic programming /
由: Birge, John R., et al.
出版: (1997)

Stochastic linear programming /
由: Kall, Peter, et al.
出版: (1980)

6.231 Dynamic Programming and Stochastic Control, Fall 2002
由: Bertsekas, Dimitri P.
出版: (2002)

Stochastic convergence /
由: 389129 Lukacs, Eugene
出版: (1975)

Stochastic programming with multiple objective functions /
由: 261569 Stancu-Minasian, I. M., et al.
出版: (1984)

Discrete stochastic programming /
由: 344357 Cocks, K. D.

Stochastic algorithms : foundations and applications : third international symposium, SAGA 2005, Moscow, Russia, October 20-22, 2005 : proceedings /
由: SAGA 2005 (2005 : Moscow, Russia), et al.
出版: (2005)

Stochastic problems in dynamics/
由: Clarkson, B. L. (Brian Leonard)
出版: (1977)

Stochastic approximation methods for constrained and unconstrained system /
由: 272176 Kushner, Harold J. (Harold Joseph), 1933-, et al.
出版: (1978)

Simultaneous perturbation stochastic aproximation for Lipshitz functions
由: Vaida Bartkutė, et al.
出版: (2004-12-01)

Stochastic Algorithms: Foundations and Applications [electronic resource] : 4th International Symposium, SAGA 2007, Zurich, Switzerland, September 13-14, 2007: Proceedings /
由: SAGA 2007 (2007 : Zurich, Switzerland), et al.
出版: (2007)

Stochastic differential systems /
由: Christopeif, N.
出版: (1986)

Stochastic linear programming/
由: 222973 Kall, Peter
出版: (1976)

Stochastic dynamics of structures /
由: Li, Jie, 1957 Oct.-, et al.
出版: (2009)

Comparison of defuzzification methods for fuzzy stochastic linear programming /
由: Gan, Siew Ling, 1984- author, et al.
出版: (2014)

L2-convergence of Yosida approximation for semi-linear backward stochastic differential equation with jumps in infinite dimension
由: Hani Abidi, et al.
出版: (2025-01-01)

State of the Art of Adaptive Dynamic Programming and Reinforcement Learning
由: Derong Liu, et al.
出版: (2022-12-01)

On approximation of stochastic integrals with respect to a fractional Brownian motion
由: Kęstutis Kubilius
出版: (2005-12-01)

Stochastic Combinatorial Optimization with Risk
由: Nikolova, Evdokia
出版: (2008)

Comparison of defuzzification methods for fuzzy stochastic linear programming [electronic resource] /
由: Gan, Siew Ling, 1984- author, et al.
出版: (2014)

Water resource management at Tabarkabad dam in Quchan city: using orthogonal polynomials to solve stochastic dynamic programming problems
由: Saeid Azimifard, et al.
出版: (2017-06-01)

Approximations in Mean Square Analysis of Stochastically Forced Equilibria for Nonlinear Dynamical Systems
由: Irina Bashkirtseva
出版: (2024-07-01)

Stochastic linear programming : models, theory, and computation /
由: 222973 Kall, Peter, et al.
出版: (2005)