A Functional Clipping Approach for Policy Optimization Algorithms

Proximal policy optimization (PPO) has yielded state-of-the-art results in policy search, a subfield of reinforcement learning, with one of its key points being the use of a surrogate objective function to restrict the step size at each policy update. Although such restriction is helpful, the algori...

Full description

Bibliographic Details
Main Authors:	Wangshu Zhu, Andre Rosendo
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Machine learning robot control deep reinforcement learning policy search algorithm
Online Access:	https://ieeexplore.ieee.org/document/9474478/

Internet

https://ieeexplore.ieee.org/document/9474478/

A Functional Clipping Approach for Policy Optimization Algorithms

Internet

Similar Items