Convergence Results for Some Temporal Difference Methods Based on Least Squares

Convergence Results for Some Temporal Difference Methods Based on Least Squares

We consider finite-state Markov decision processes, and prove convergence and rate of convergence results for certain least squares policy evaluation algorithms of the type known as LSPE(lambda ). These are temporal difference methods for constructing a linear function approximation of the cost func...

Full description

Bibliographic Details
Main Authors:	Yu, Huizhen, Bertsekas, Dimitri P.
Other Authors:	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
Format:	Article
Language:	en_US
Published:	Institute of Electrical and Electronics Engineers 2012
Online Access:	http://hdl.handle.net/1721.1/74102 https://orcid.org/0000-0001-6909-7208

Similar Items

Least Squares Temporal Difference Methods: An Analysis under General Conditions
by: Yu, Huizhen
Published: (2013)

A unified framework for temporal difference methods
by: Bertsekas, Dimitri P.
Published: (2010)

Pathologies of Temporal Difference Methods in Approximate Dynamic Programming
by: Bertsekas, Dimitri P.
Published: (2011)

Proximal algorithms and temporal difference methods for solving fixed point problems
by: Bertsekas, Dimitri P
Published: (2021)

Basis Function Adaptation Methods for Cost Approximation in MDP
by: Yu, Huizhen, et al.
Published: (2010)

Some new asymptotic theory for least squares series: Pointwise and uniform results
by: Belloni, Alexandre, et al.
Published: (2018)

Convergence of the Least Squares Shadowing Method for Computing Derivative of Ergodic Averages
by: Wang, Qiqi
Published: (2014)

Approximate policy iteration: A survey and some new methods
by: Bertsekas, Dimitri P.
Published: (2012)

Q-learning and policy iteration algorithms for stochastic shortest path problems
by: Yu, Huizhen, et al.
Published: (2015)

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems
by: Yu, Huizhen, et al.
Published: (2015)

A Unifying Polyhedral Approximation Framework for Convex Optimization
by: Bertsekas, Dimitri P., et al.
Published: (2011)

Distributed Asynchronous Policy Iteration in Dynamic Programming
by: Bertsekas, Dimitri P., et al.
Published: (2011)

Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
by: Bertsekas, Dimitri P, et al.
Published: (2019)

A least squares convergence criterion for nonequilibrium boundary layer solutions.
by: Elgin, James Brinson.
Published: (2023)

Available Transfer Capability and Least Square Method
by: Hojabri, Mojgan, et al.
Published: (2012)

Available transfer capability and least square method
by: Hojabri, Mojgan, et al.
Published: (2012)

Convex Total Least Squares
by: Slavov, Nikolai G, et al.
Published: (2015)

Valuing American style derivatives by least squares methods
by: Cerrato, Mario
Published: (2007)

Partial Least Squares:Another Method Of Structural Equation
by: Perpustakaan UGM, i-lib
Published: (2004)

Partial Least Squares: Another Method Of Structural Equation
by: Perpustakaan UGM, i-lib
Published: (2005)

Incremental least squares methods and the extended Kalman filter
Published: (2003)

Convergence and Stability of Iteratively Re-weighted Least Squares Algorithms for Sparse Signal Recovery in the Presence of Noise
by: Babadi, Behtash, et al.
Published: (2014)

Notes on Regularized Least Squares
by: Rifkin, Ryan M., et al.
Published: (2007)

The calculation of ordinary least squares,
by: Hall, Robert Ernest
Published: (2011)

Registration-Based Guided Least-Squares Waveform Inversion
by: Calandra, Henri, et al.
Published: (2017)

Least Squares Shadowing Method for Sensitivity Analysis of Differential Equations
by: Chater, Mario, et al.
Published: (2018)

A hybrid incremental gradient method for least squares problems
Published: (2003)

Modified Chi Square Test of Goodness of Fit (MCSTGF) Based on Least Square Method: An Application to Water Transportation
by: Okwonu, Friday Zinzendoff, et al.
Published: (2021)

Asymptotics of Gaussian Regularized Least-Squares
by: Lippert, Ross, et al.
Published: (2005)

MODEL REGRESI LINEAR PARTIAL LEAST SQUARE TERGENERALISASI PARTIAL LEAST SQUARE GENERALISED LINEAR REGRESSION
by: , ANI APRIANI, et al.
Published: (2012)

An asymmetric least squares test of heteroscedasticity
by: Newey, Whitney K., et al.
Published: (2011)

Incremental proximal methods for large scale convex optimization
by: Bertsekas, Dimitri P.
Published: (2012)

Massively Parallel Solver for the High-Order Galerkin Least-Squares Method
by: Yano, Masayuki
Published: (2010)

ANALISIS PARTIAL LEAST SQUARES REGRESI (PLS-R) PARTIAL LEAST SQUARES REGRESSION (PLS-R) ANALYSIS
by: , INGGRIT RABERTA, et al.
Published: (2013)

Derivative estimation of triangular patch by using cubic least square method
by: Awang, Noorehan, et al.
Published: (2016)

Improvement of least-squares integration method with iterative compensations in fringe reflectometry
by: Huang, Lei, et al.
Published: (2013)

A new class of incremental gradient methods for least squares problems
Published: (2003)

Massively parallel solver for the high-order Galerkin Least-Squares method
by: Yano, Masayuki, Ph. D. Massachusetts Institute of Technology
Published: (2010)

Fast Rates for Regularized Least-squares Algorithm
by: Caponnetto, Andrea, et al.
Published: (2005)

Registration-guided least-squares waveform inversion
by: Baek, Hyoungsu, et al.
Published: (2014)