Consistent Experience Replay in High-Dimensional Continuous Control with Decayed Hindsights

The manipulation of complex robotics, which is in general high-dimensional continuous control without an accurate dynamic model, summons studies and applications of reinforcement learning (RL) algorithms. Typically, RL learns with the objective of maximizing the accumulated rewards from interactions...

Full description

Bibliographic Details
Main Author:	Xiaoyun Feng
Format:	Article
Language:	English
Published:	MDPI AG 2022-09-01
Series:	Machines
Subjects:	robotic control goal-conditioned reinforcement learning offline reinforcement learning sparse rewards experience replay hindsight bias
Online Access:	https://www.mdpi.com/2075-1702/10/10/856

Internet

https://www.mdpi.com/2075-1702/10/10/856

Consistent Experience Replay in High-Dimensional Continuous Control with Decayed Hindsights

Internet

Similar Items