Rewards Prediction-Based Credit Assignment for Reinforcement Learning With Sparse Binary Rewards

In reinforcement learning (RL), a reinforcement signal may be infrequent and delayed, not appearing immediately after the action that triggered the reward. To trace back what sequence of actions contributes to delayed rewards, e.g., credit assignment (CA), is one of the biggest challenges in RL. Thi...

Full description

Bibliographic Details
Main Authors: Minah Seo, Luiz Felipe Vecchietti, Sangkeum Lee, Dongsoo Har
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8809762/