Proximal policy optimization with adaptive threshold for symmetric relative density ratio

Deep reinforcement learning (DRL) is one of the promising approaches for introducing robots into complicated environments. The recent remarkable progress of DRL stands on regularization of policy, which allows the policy to improve stably and efficiently. A popular method, so-called proximal policy...

Full description

Bibliographic Details
Main Author: Taisuke Kobayashi
Format: Article
Language:English
Published: Elsevier 2023-03-01
Series:Results in Control and Optimization
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666720722000649