Exploiting multiple abstractions in episodic RL via reward shaping

One major limitation to the applicability of Reinforcement Learning (RL) to many practical domains is the large number of samples required to learn an optimal policy. To address this problem and improve learning efficiency, we consider a linear hierarchy of abstraction layers of the Markov Decision...

Cijeli opis

Bibliografski detalji
Glavni autori: Cipollone, R, De Giacomo, G, Favorito, M, Iocchi, L, Patrizi, F
Format: Conference item
Jezik:English
Izdano: Association for the Advancement of Artificial Intelligence 2023