Deep residual reinforcement learning
<p>We revisit residual algorithms in both model-free and model-based reinforcement learning settings. We propose the bidirectional target network technique to stabilize residual algorithms, yielding a residual version of DDPG that significantly outperforms vanilla DDPG in the DeepMind Control...
Hlavní autoři: | , , |
---|---|
Médium: | Conference item |
Jazyk: | English |
Vydáno: |
International Foundation for Autonomous Agents and Multiagent Systems
2020
|