Stochastic approximation for non-expansive maps : application to Q-learning algorithms
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1997.
Prif Awdur: | Abounadi, Jinane, 1966- |
---|---|
Awduron Eraill: | Dimitri P. Bersekas. |
Fformat: | Traethawd Ymchwil |
Iaith: | eng |
Cyhoeddwyd: |
Massachusetts Institute of Technology
2005
|
Pynciau: | |
Mynediad Ar-lein: | http://hdl.handle.net/1721.1/10033 |
Eitemau Tebyg
-
Asynchronous stochastic approximation and Q-learning
Cyhoeddwyd: (2003) -
Approximation algorithms for stochastic scheduling problems
gan: Dean, Brian C. (Brian Christopher), 1975-
Cyhoeddwyd: (2006) -
Approximation algorithms for stochastic scheduling on unrelated machines
gan: Scott, Jacob (Jacob Healy)
Cyhoeddwyd: (2009) -
Realization and approximation of stationary stochastic processes
gan: Avniel, Yehuda
Cyhoeddwyd: (2005) -
Q-learning and policy iteration algorithms for stochastic shortest path problems
gan: Yu, Huizhen, et al.
Cyhoeddwyd: (2015)