Newton’s method for reinforcement learning and model predictive control

The purpose of this paper is to propose and develop a new conceptual framework for approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework centers around two algorithms, which are designed largely independently of each other and operate in synergy through the powerful me...

Full description

Bibliographic Details
Main Author: Dimitri Bertsekas
Format: Article
Language:English
Published: Elsevier 2022-06-01
Series:Results in Control and Optimization
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666720722000157