Breaking the deadly triad in reinforcement learning

<p>Reinforcement Learning (RL) is a promising framework for solving sequential decision making problems emerging from agent-environment interactions via trial and error. Off-policy learning is one of the most important techniques in RL, which enables an RL agent to learn from agent-environment...

Full description

Bibliographic Details
Main Author: Zhang, S
Other Authors: Whiteson, S
Format: Thesis
Language:English
Published: 2022
Subjects:

Similar Items