A tutorial on the art of dynamic programming for some issues concerning Bellman’s principle of optimality

Reinforcement learning (RL) is fundamental to current artificial intelligence (AI). Since dynamic programming, which is based on Richard Bellman’s principle of optimality, is the basis of RL (and other AI disciplines such as A* search), it is important to apply that principle correctly and artfully....

Full description

Bibliographic Details
Main Authors: Eiji Mizutani, Stuart Dreyfus
Format: Article
Language:English
Published: Elsevier 2023-12-01
Series:ICT Express
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405959523000802