Deep Deterministic Policy Gradient with Episode Experience Replay
The research on continuous control in reinforcement learning has been a hot topic in recent years.The deep deterministic policy gradient (DDPG) algorithm performs well in continuous control tasks.DDPG algorithm uses experience replay mechanism to train the network model,and in order to further impro...
Main Author: | |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial office of Computer Science
2021-10-01
|
Series: | Jisuanji kexue |
Subjects: | |
Online Access: | http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-10-37.pdf |