Bidirectional Model-Based Policy Optimization Based on Adaptive Gaussian Noise and Improved Confidence Weights

Model-Based Reinforcement Learning (MBRL) has been gradually applied in the field of Robot Learning due to its excellent sample efficiency and asymptotic performance. However, for high-dimensional learning tasks in complex scenes, the exploration and stable training capabilities of the robot still n...

Full description

Bibliographic Details
Main Authors:	Wei Liu, Mengyuan Liu, Bao Jin, Yixin Zhu, Qi Gao, Jiayang Sun
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Model-based reinforcement learning Gaussian noise confidence weight
Online Access:	https://ieeexplore.ieee.org/document/10225738/

Internet

https://ieeexplore.ieee.org/document/10225738/

Bidirectional Model-Based Policy Optimization Based on Adaptive Gaussian Noise and Improved Confidence Weights

Internet

Similar Items