Least Squares Temporal Difference Methods: An Analysis under General Conditions

Least Squares Temporal Difference Methods: An Analysis under General Conditions

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) with the least squares temporal difference (LSTD) algorithm, LSTD($\lambda$), in an exploration-enhanced learning context, where policy costs are computed from observations of a Markov chain differe...

Full description

Bibliographic Details
Main Author:	Yu, Huizhen
Other Authors:	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
Format:	Article
Language:	en_US
Published:	Society for Industrial and Applied Mathematics 2013
Online Access:	http://hdl.handle.net/1721.1/77629

Similar Items

Convergence Results for Some Temporal Difference Methods Based on Least Squares
by: Yu, Huizhen, et al.
Published: (2012)

Least Squares Shadowing Method for Sensitivity Analysis of Differential Equations
by: Chater, Mario, et al.
Published: (2018)

Available Transfer Capability and Least Square Method
by: Hojabri, Mojgan, et al.
Published: (2012)

Available transfer capability and least square method
by: Hojabri, Mojgan, et al.
Published: (2012)

Robust feasible generalized least squares: A remedial measures of heteroscedasticity
by: Rana, Sohel, et al.
Published: (2015)

ANALISIS PARTIAL LEAST SQUARES REGRESI (PLS-R) PARTIAL LEAST SQUARES REGRESSION (PLS-R) ANALYSIS
by: , INGGRIT RABERTA, et al.
Published: (2013)

Imposing jump conditions on nonconforming interfaces for the Correction Function Method: A least squares approach
by: Marques, Alexandre N., et al.
Published: (2020)

Convex Total Least Squares
by: Slavov, Nikolai G, et al.
Published: (2015)

Models where the least trimmed squares and least median of squares estimators are maximum likelihood
by: Berenguer-Rico, V, et al.
Published: (2019)

Valuing American style derivatives by least squares methods
by: Cerrato, Mario
Published: (2007)

Partial Least Squares:Another Method Of Structural Equation
by: Perpustakaan UGM, i-lib
Published: (2004)

Partial Least Squares: Another Method Of Structural Equation
by: Perpustakaan UGM, i-lib
Published: (2005)

Incremental least squares methods and the extended Kalman filter
Published: (2003)

Notes on Regularized Least Squares
by: Rifkin, Ryan M., et al.
Published: (2007)

The calculation of ordinary least squares,
by: Hall, Robert Ernest
Published: (2011)

Enacting Alternating Least Square Algorithm to Estimate Model Fit of Sem Generalized Structured Component Analysis
by: Steffani, Cylvia Nissa, et al.
Published: (2022)

A hybrid incremental gradient method for least squares problems
Published: (2003)

Asymptotics of Gaussian Regularized Least-Squares
by: Lippert, Ross, et al.
Published: (2005)

MODEL REGRESI LINEAR PARTIAL LEAST SQUARE TERGENERALISASI PARTIAL LEAST SQUARE GENERALISED LINEAR REGRESSION
by: , ANI APRIANI, et al.
Published: (2012)

An asymmetric least squares test of heteroscedasticity
by: Newey, Whitney K., et al.
Published: (2011)

Convergence of the Least Squares Shadowing Method for Computing Derivative of Ergodic Averages
by: Wang, Qiqi
Published: (2014)

Massively Parallel Solver for the High-Order Galerkin Least-Squares Method
by: Yano, Masayuki
Published: (2010)

Improvement of least-squares integration method with iterative compensations in fringe reflectometry
by: Huang, Lei, et al.
Published: (2013)

Derivative estimation of triangular patch by using cubic least square method
by: Awang, Noorehan, et al.
Published: (2016)

A new class of incremental gradient methods for least squares problems
Published: (2003)

Massively parallel solver for the high-order Galerkin Least-Squares method
by: Yano, Masayuki, Ph. D. Massachusetts Institute of Technology
Published: (2010)

Institutional ownership and market-based performance indicators: Utilizing generalized least square estimation technique
by: Che Ahmad, Ayoib, et al.
Published: (2014)

Least Squares Shadowing for sensitivity analysis of chaotic dynamical systems
by: Chater, Mario
Published: (2016)

Fast Rates for Regularized Least-squares Algorithm
by: Caponnetto, Andrea, et al.
Published: (2005)

Registration-guided least-squares waveform inversion
by: Baek, Hyoungsu, et al.
Published: (2014)

Blendenpik: Supercharging LAPACK's Least-Squares Solver
by: Maymounkov, Petar Borissov, et al.
Published: (2011)

Nonlinear least-squares filtering and frequency modulation
Published: (2004)

On the Routh approximation technique and least squares errors
Published: (2002)

Framework for gradient integration by combining radial basis functions method and least-squares method
by: Huang, Lei, et al.
Published: (2013)

An incremental trust-region method for Robust online sparse least-squares estimation
by: Rosen, David Matthew, et al.
Published: (2013)

Least Squares Shadowing sensitivity analysis of chaotic limit cycle oscillations
by: Wang, Qiqi, et al.
Published: (2016)

Modified Chi Square Test of Goodness of Fit (MCSTGF) Based on Least Square Method: An Application to Water Transportation
by: Okwonu, Friday Zinzendoff, et al.
Published: (2021)

Estimation of closed-form least-squares source location from range-difference measurements
by: Balasubramaniam Gangadevi
Published: (2013)

Vulnerability to Poverty of Rural Households in Pattani Province: A Feasible Generalized Least Squares (FGLS) Approach
by: Leekoi, Pha-isah, et al.
Published: (2018)

Estimation of the design elements of horizontal alignment by the method of least squares/Jalina Rabi
by: Rabi, Jalina
Published: (2006)