Q-learning and policy iteration algorithms for stochastic shortest path problems

Q-learning and policy iteration algorithms for stochastic shortest path problems

We consider the stochastic shortest path problem, a classical finite-state Markovian decision problem with a termination state, and we propose new convergent Q-learning algorithms that combine elements of policy iteration and classical Q-learning/value iteration. These algorithms are related to the...

Full description

Bibliographic Details
Main Authors:	Yu, Huizhen, Bertsekas, Dimitri P.
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	en_US
Published:	Springer-Verlag 2015
Online Access:	http://hdl.handle.net/1721.1/93745 https://orcid.org/0000-0001-6909-7208

Similar Items

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems
by: Yu, Huizhen, et al.
Published: (2015)

Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
by: Bertsekas, Dimitri P, et al.
Published: (2019)

Distributed Asynchronous Policy Iteration in Dynamic Programming
by: Bertsekas, Dimitri P., et al.
Published: (2011)

An analysis of stochastic shortest path problems
Published: (2003)

Stochastic shortest path problems with recourse
Published: (2003)

Stochastic and shortest path games : theory and algorithms
by: Patek, Stephen D. (Stephen David)
Published: (2005)

Stochastic shortest path games : theory and algorithms
by: Patek, Stephen D. (Stephen David)
Published: (2005)

Faster algorithms for the shortest path problem
by: Ahuja, Ravindra K.
Published: (2009)

Approximate policy iteration: A survey and some new methods
by: Bertsekas, Dimitri P.
Published: (2012)

Algorithms for the shortest path problem with time windows and shortest path reoptimization in time-dependent networks
by: Glenn, Andrew M., 1978-
Published: (2014)

Stabilization of Stochastic Iterative Methods for Singular and Nearly Singular Linear Systems
by: Wang, Mengdi, et al.
Published: (2015)

An Anytime Algorithm for Chance Constrained Stochastic Shortest Path Problems and Its Application to Aircraft Routing
by: Hong, Sungkweon, et al.
Published: (2022)

Maximizing the probability of arriving on time : a stochastic shortest path problem
by: Cao, Zhiguang
Published: (2017)

Efficient algorithms for continuous-space shortest path problems
Published: (2003)

An auction algorithm for shortest paths
Published: (2003)

Novel approach to solving stochastic constrained shortest path problem with quantum computing
by: Yang, Richard Chen Xiao
Published: (2023)

Risk-bounded Programming using Constrained, Hierarchical, Stochastic Shortest Path Problems
by: Hong, Sungkweon
Published: (2023)

Ambulance shortest path problem by using link-based algorithm
by: Lee, Chooi Hua
Published: (2015)

The origin-destination shortest path problem
Published: (2003)

Stochastic and dynamic shortest distance problems
by: Polychronopoulos, George H. (George Harry)
Published: (2005)

Shortest Path Algorithms: A Comparison
by: Golden, Bruce L., 1950-
Published: (2004)

An experiment on the performance of shortest path algorithm
by: Chan, Simon Yew Meng, et al.
Published: (2016)

Modified auction algorithms for shortest paths
Published: (2003)

Parallel shortest path auction algorithms
Published: (2003)

Polynomial auction algorithms for shortest paths
Published: (2003)

Basis Function Adaptation Methods for Cost Approximation in MDP
by: Yu, Huizhen, et al.
Published: (2010)

Convergence Results for Some Temporal Difference Methods Based on Least Squares
by: Yu, Huizhen, et al.
Published: (2012)

A Unifying Polyhedral Approximation Framework for Convex Optimization
by: Bertsekas, Dimitri P., et al.
Published: (2011)

Dynamic shortest path algorithms for IVHS applications
by: Farkas, András, 1965-
Published: (2005)

Continuous-time dynamics shortest path algorithms
by: Dean, Brian C. (Brian Christopher), 1975-
Published: (2013)

Communication complexity of distributed shortest path algorithms
by: Friedman, Daniel Uri
Published: (2005)

Communication complexity of distributed shortest path algorithms
Published: (2003)

An approximate shortest path algorithm for hierarchical networks
by: Bhagavatula, Krishna K. (Krishna Kishore)
Published: (2005)

An adaptive distributed Dijkstra shortest path algorithm
Published: (2003)

An auction/sequential shortest path algorithm for the minimum cost network flow problem
Published: (2003)

Sensitivity Analysis for Shortest Path Problems and Maximum Capacity Path Problems in Undirected Graphs
by: Ramaswamy, Ramkumar, et al.
Published: (2004)

Sensitivity Analysis for Shortest Path Problems and Maximum Capacity Path Problems in Undirected Graphs
by: Ramaswamy, Ramkumar, et al.
Published: (2004)

Proximal algorithms and temporal difference methods for solving fixed point problems
by: Bertsekas, Dimitri P
Published: (2021)

Parameter Shortest Path Algorithms with an Application to Cyclic Staffing
by: Karp, Richard M., et al.
Published: (2004)

Parallel implementations of dynamic traffic assignment models and algorithms for dynamic shortest path problems
by: Jiang, Hai, 1979-
Published: (2006)