Newton’s method for reinforcement learning and model predictive control
The purpose of this paper is to propose and develop a new conceptual framework for approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework centers around two algorithms, which are designed largely independently of each other and operate in synergy through the powerful me...
Main Author: | Dimitri Bertsekas |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2022-06-01
|
Series: | Results in Control and Optimization |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666720722000157 |
Similar Items
-
Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
by: Yifan Gao, et al.
Published: (2021-06-01) -
Inteligencia Artificial y Juegos de Tablero: Desde el Turco hasta AlphaZero
by: Ivan Francisco Valencia, et al.
Published: (2023-05-01) -
Chess AI: Competing Paradigms for Machine Intelligence
by: Shiva Maharaj, et al.
Published: (2022-04-01) -
Interactive Robot for Playing Russian Checkers
by: Ekaterina E. Kopets, et al.
Published: (2020-12-01) -
On the Value of Chess Squares
by: Aditya Gupta, et al.
Published: (2023-09-01)