Proximal Policy Optimization Based on Self-directed Action Selection
The optimization algorithm of monotonous improvement of strategy in reinforcement learning is a current research hotspot,and it has achieved good performance in both discrete and continuous control tasks.Proximal policy optimization(PPO)algorithm is a classic strategy monotonic promotion algorithm,b...
Main Author: | SHEN Yi, LIU Quan |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial office of Computer Science
2021-12-01
|
Series: | Jisuanji kexue |
Subjects: | |
Online Access: | https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-12-297.pdf |
Similar Items
-
Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment
by: Weimin Chen, et al.
Published: (2022-03-01) -
Intelligent Design of Hairpin Filters Based on Artificial Neural Network and Proximal Policy Optimization
by: Yunong Ye, et al.
Published: (2023-08-01) -
An Improved Proximal Policy Optimization Method for Low-Level Control of a Quadrotor
by: Wentao Xue, et al.
Published: (2022-04-01) -
An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization
by: Rousslan Fernand Julien Dossa, et al.
Published: (2021-01-01) -
A new approach for drone tracking with drone using Proximal Policy Optimization based distributed deep reinforcement learning
by: Ziya Tan, et al.
Published: (2023-07-01)