A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design
Considering the dynamics and non-linear characteristics of biped robots, gait optimization is an extremely challenging task. To tackle this issue, a parallel heterogeneous policy Deep Reinforcement Learning (DRL) algorithm for gait optimization is proposed. Firstly, the Deep Deterministic Policy Gra...
Hlavní autoři: | , , |
---|---|
Médium: | Článek |
Jazyk: | English |
Vydáno: |
Frontiers Media S.A.
2023-08-01
|
Edice: | Frontiers in Neurorobotics |
Témata: | |
On-line přístup: | https://www.frontiersin.org/articles/10.3389/fnbot.2023.1205775/full |