Deep Deterministic Policy Gradient with Episode Experience Replay

The research on continuous control in reinforcement learning has been a hot topic in recent years.The deep deterministic policy gradient (DDPG) algorithm performs well in continuous control tasks.DDPG algorithm uses experience replay mechanism to train the network model,and in order to further impro...

Full description

Bibliographic Details
Main Author:	ZHANG Jian-hang, LIU Quan
Format:	Article
Language:	zho
Published:	Editorial office of Computer Science 2021-10-01
Series:	Jisuanji kexue
Subjects:	deep deterministic policy gradient\|continuous control tasks\|experience replay\|cumulative reward\|classifying experience replay
Online Access:	http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-10-37.pdf

Internet

http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-10-37.pdf

Deep Deterministic Policy Gradient with Episode Experience Replay

Internet

Similar Items