Exploiting multiple abstractions in episodic RL via reward shaping

One major limitation to the applicability of Reinforcement Learning (RL) to many practical domains is the large number of samples required to learn an optimal policy. To address this problem and improve learning efficiency, we consider a linear hierarchy of abstraction layers of the Markov Decision...

Full description

Bibliographic Details
Main Authors: Cipollone, R, De Giacomo, G, Favorito, M, Iocchi, L, Patrizi, F
Format: Conference item
Language:English
Published: Association for the Advancement of Artificial Intelligence 2023