A Unifying Framework for Reinforcement Learning and Planning

Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both...

Full description

Bibliographic Details
Main Authors: Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-07-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2022.908353/full