Newton’s method for reinforcement learning and model predictive control

The purpose of this paper is to propose and develop a new conceptual framework for approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework centers around two algorithms, which are designed largely independently of each other and operate in synergy through the powerful me...

Full description

Bibliographic Details
Main Author:	Dimitri Bertsekas
Format:	Article
Language:	English
Published:	Elsevier 2022-06-01
Series:	Results in Control and Optimization
Subjects:	AlphaZero Off-line training On-line play Dynamic programming over an infinite horizon Reinforcement learning Model predictive control
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666720722000157

Internet

http://www.sciencedirect.com/science/article/pii/S2666720722000157

Newton’s method for reinforcement learning and model predictive control

Internet

Similar Items