Breaking the deadly triad in reinforcement learning
<p>Reinforcement Learning (RL) is a promising framework for solving sequential decision making problems emerging from agent-environment interactions via trial and error. Off-policy learning is one of the most important techniques in RL, which enables an RL agent to learn from agent-environment...
Main Author: | Zhang, S |
---|---|
Other Authors: | Whiteson, S |
Format: | Thesis |
Language: | English |
Published: |
2022
|
Subjects: |
Similar Items
-
Breaking the deadly triad with a target network
by: Zhang, S, et al.
Published: (2021) -
Learning automata : an introduction /
by: 182643 Narendra, Kumpati S., et al.
Published: (1989) -
Mission-ready Reinforcement Learning
Published: (2020) -
Learning and Invariance in a Family of Hierarchical Kernels
by: Wibisono, Andre, et al.
Published: (2010) -
Combine and conquer: representation learning from multmodal data
by: Shi, Y
Published: (2023)