Deep Deterministic Policy Gradient with Episode Experience Replay

The research on continuous control in reinforcement learning has been a hot topic in recent years.The deep deterministic policy gradient (DDPG) algorithm performs well in continuous control tasks.DDPG algorithm uses experience replay mechanism to train the network model,and in order to further impro...

Full description

Bibliographic Details
Main Author: ZHANG Jian-hang, LIU Quan
Format: Article
Language:zho
Published: Editorial office of Computer Science 2021-10-01
Series:Jisuanji kexue
Subjects:
Online Access:http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-10-37.pdf