On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

We consider a totally asynchronous stochastic approximation algorithm, Q-learning, for solving finite space stochastic shortest path (SSP) problems, which are undiscounted, total cost Markov decision processes with an absorbing and cost-free state. For the most commonly used SSP models, existing con...

Full description

Bibliographic Details
Main Authors:	Yu, Huizhen, Bertsekas, Dimitri P.
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	Institute for Operations Research and the Management Sciences (INFORMS) 2015
Online Access:	http://hdl.handle.net/1721.1/93744 https://orcid.org/0000-0001-6909-7208

Similar Items

Q-learning and policy iteration algorithms for stochastic shortest path problems
by: Yu, Huizhen, et al.
Published: (2015)

Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
by: Bertsekas, Dimitri P, et al.
Published: (2019)

Stochastic shortest path problems with recourse
Published: (2003)

An analysis of stochastic shortest path problems
Published: (2003)

Distributed Asynchronous Policy Iteration in Dynamic Programming
by: Bertsekas, Dimitri P., et al.
Published: (2011)

Stabilization of Stochastic Iterative Methods for Singular and Nearly Singular Linear Systems
by: Wang, Mengdi, et al.
Published: (2015)

Dynamic shortest path in stochastic dynamic networks : ship routing problem /
by: 261086 Teoh, Gaik Hoon
Published: (2004)

Stochastic and shortest path games : theory and algorithms
by: Patek, Stephen D. (Stephen David)
Published: (2005)

Stochastic shortest path games : theory and algorithms
by: Patek, Stephen D. (Stephen David)
Published: (2005)

Broadcasting and shortest path problem /
by: Nur Sheilawathy Rosidi, 1987-, et al.
Published: (2012)

Shortest paths : variational problems /
by: 323546 Liusternik, Lazar Aronovich
Published: (1964)

Risk-bounded Programming using Constrained, Hierarchical, Stochastic Shortest Path Problems
by: Hong, Sungkweon
Published: (2023)

Stochastic and dynamic shortest distance problems
by: Polychronopoulos, George H. (George Harry)
Published: (2005)

Membrane computing for shortest path problem /
by: Einallah Salehi, 1972-, author, et al.
Published: (2015)

Membrane computing for shortest path problem /
by: Einallah Salehi, 1972-, author
Published: (2015)

Solution methods of shortest path problem /
by: 493604 Gan, Shih Sze, et al.
Published: (2008)

An Algorithm for the Cycled Shortest Path Problem
by: Asghar Aini, et al.
Published: (2011-06-01)

Random assignment and shortest path problems
by: Johan Wästlund
Published: (2006-01-01)

The origin-destination shortest path problem
Published: (2003)

Faster algorithms for the shortest path problem
by: Ahuja, Ravindra K.
Published: (2009)

Dynamic shortest path in stochastic dynamic networks : ship routing problem [compact disc] /
by: 261086 Teoh, Gaik Hoon
Published: (2004)

Algorithms for the shortest path problem with time windows and shortest path reoptimization in time-dependent networks
by: Glenn, Andrew M., 1978-
Published: (2014)

Approximate policy iteration: A survey and some new methods
by: Bertsekas, Dimitri P.
Published: (2012)

An Anytime Algorithm for Chance Constrained Stochastic Shortest Path Problems and Its Application to Aircraft Routing
by: Hong, Sungkweon, et al.
Published: (2022)

An FPTAS for Dynamic Multiobjective Shortest Path Problems
by: Pedro Maristany de las Casas, et al.
Published: (2021-01-01)

The Shortest Path Problem for a Multiple Graph
by: Alexander V. Smirnov
Published: (2017-12-01)

Problems on Shortest k-Node Cycles and Paths
by: Petro Stetsyuk, et al.
Published: (2021-09-01)

On solving a Stochastic Shortest-Path Markov Decision Process as probabilistic inference
by: Baioumy, M, et al.
Published: (2022)

Sensitivity Analysis for Shortest Path Problems and Maximum Capacity Path Problems in Undirected Graphs
by: Ramaswamy, Ramkumar, et al.
Published: (2004)

Sensitivity Analysis for Shortest Path Problems and Maximum Capacity Path Problems in Undirected Graphs
by: Ramaswamy, Ramkumar, et al.
Published: (2004)

Dynamic and stochastic control
by: 196299 Bertsekas, Dimitri P.
Published: (1976)

The application of the shortest path problem in pipeline route planning /
by: 499308 Nor Ashraf Norsahperi, et al.
Published: (2011)

Efficient algorithms for continuous-space shortest path problems
Published: (2003)

A Decomposition Approach for Stochastic Shortest-Path Network Interdiction with Goal Threshold
by: Xiangyu Wei, et al.
Published: (2019-02-01)

Dynamic programming : deterministic and stochastic models /
by: 196299 Bertsekas, Dimitri P.
Published: (1987)

6.231 Dynamic Programming and Stochastic Control, Fall 2002
by: Bertsekas, Dimitri P.
Published: (2002)

Basis Function Adaptation Methods for Cost Approximation in MDP
by: Yu, Huizhen, et al.
Published: (2010)

Convergence Results for Some Temporal Difference Methods Based on Least Squares
by: Yu, Huizhen, et al.
Published: (2012)

A Unifying Polyhedral Approximation Framework for Convex Optimization
by: Bertsekas, Dimitri P., et al.
Published: (2011)

6.231 Dynamic Programming and Stochastic Control, Fall 2008
by: Bertsekas, Dimitri
Published: (2008)