Proximal Policy Optimization Based on Self-directed Action Selection

Proximal Policy Optimization Based on Self-directed Action Selection

The optimization algorithm of monotonous improvement of strategy in reinforcement learning is a current research hotspot,and it has achieved good performance in both discrete and continuous control tasks.Proximal policy optimization(PPO)algorithm is a classic strategy monotonic promotion algorithm,b...

Full description

Bibliographic Details
Main Author:	SHEN Yi, LIU Quan
Format:	Article
Language:	zho
Published:	Editorial office of Computer Science 2021-12-01
Series:	Jisuanji kexue
Subjects:	reinforcement learning\|deep reinforcement learning\|policy gradient\|proximal policy optimization\|self-directed
Online Access:	https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-12-297.pdf

Similar Items

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment
by: Weimin Chen, et al.
Published: (2022-03-01)

Intelligent Design of Hairpin Filters Based on Artificial Neural Network and Proximal Policy Optimization
by: Yunong Ye, et al.
Published: (2023-08-01)

An Improved Proximal Policy Optimization Method for Low-Level Control of a Quadrotor
by: Wentao Xue, et al.
Published: (2022-04-01)

An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization
by: Rousslan Fernand Julien Dossa, et al.
Published: (2021-01-01)

A new approach for drone tracking with drone using Proximal Policy Optimization based distributed deep reinforcement learning
by: Ziya Tan, et al.
Published: (2023-07-01)

An Object Recognition Grasping Approach Using Proximal Policy Optimization With YOLOv5
by: Qingchun Zheng, et al.
Published: (2023-01-01)

Exploring the Use of Invalid Action Masking in Reinforcement Learning: A Comparative Study of On-Policy and Off-Policy Algorithms in Real-Time Strategy Games
by: Yueqi Hou, et al.
Published: (2023-07-01)

Process control of mAb production using multi-actor proximal policy optimization
by: Nikita Gupta, et al.
Published: (2023-09-01)

An Enhanced Proximal Policy Optimization-Based Reinforcement Learning Method with Random Forest for Hyperparameter Optimization
by: Zhixin Ma, et al.
Published: (2022-07-01)

Optimal Control Algorithm for Subway Train Operation by Proximal Policy Optimization
by: Bin Chen, et al.
Published: (2023-06-01)

Proximal policy optimization with adaptive threshold for symmetric relative density ratio
by: Taisuke Kobayashi
Published: (2023-03-01)

An AGC Dynamic Optimization Method Based on Proximal Policy Optimization
by: Zhao Liu, et al.
Published: (2022-07-01)

Adaptive Supply Chain: Demand–Supply Synchronization Using Deep Reinforcement Learning
by: Zhandos Kegenbekov, et al.
Published: (2021-08-01)

An Empirical Study of DDPG and PPO-Based Reinforcement Learning Algorithms for Autonomous Driving
by: Sanjna Siboo, et al.
Published: (2023-01-01)

Exploring Parameter Space in Reinforcement Learning
by: Rückstieß Thomas, et al.
Published: (2010-03-01)

Optimal economic dispatch of a virtual power plant based on gated recurrent unit proximal policy optimization
by: Zhiping Gao, et al.
Published: (2024-02-01)

Deep Reinforcement Learning Based on Proximal Policy Optimization for the Maintenance of a Wind Farm with Multiple Crews
by: Luca Pinciroli, et al.
Published: (2021-10-01)

ROBB: Recurrent Proximal Policy Optimization Reinforcement Learning for Optimal Block Formation in Bitcoin Blockchain Network
by: Amit Dutta, et al.
Published: (2024-01-01)

Robotic-Arm-Based Force Control by Deep Deterministic Policy Gradient in Neurosurgical Practice
by: Ibai Inziarte-Hidalgo, et al.
Published: (2023-09-01)

Enhanced Deep Deterministic Policy Gradient Algorithm Using Grey Wolf Optimizer for Continuous Control Tasks
by: Ebrahim Hamid Hasan Sumiea, et al.
Published: (2023-01-01)

Deep deterministic policy gradient and graph convolutional network for bracing direction optimization of grid shells
by: Chi-tathon Kupwiwat, et al.
Published: (2022-08-01)

Parallel Bootstrap-Based On-Policy Deep Reinforcement Learning for Continuous Fluid Flow Control Applications
by: Jonathan Viquerat, et al.
Published: (2023-07-01)

Dandelion Optimizer-Based Reinforcement Learning Techniques for MPPT of Grid- Connected Photovoltaic Systems
by: Ghazi A. Ghazi, et al.
Published: (2024-01-01)

Cooperative Control for Multi-Intersection Traffic Signal Based on Deep Reinforcement Learning and Imitation Learning
by: Yusen Huo, et al.
Published: (2020-01-01)

Toward Self-Driving Bicycles Using State-of-the-Art Deep Reinforcement Learning Algorithms
by: SeungYoon Choi, et al.
Published: (2019-02-01)

Comparison of On-Policy Deep Reinforcement Learning A2C with Off-Policy DQN in Irrigation Optimization: A Case Study at a Site in Portugal
by: Khadijeh Alibabaei, et al.
Published: (2022-06-01)

Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection
by: Duy Quang Tran, et al.
Published: (2020-08-01)

DDRCN: Deep Deterministic Policy Gradient Recommendation Framework Fused with Deep Cross Networks
by: Tianhan Gao, et al.
Published: (2023-02-01)

Proximal Policy Optimization-Based Reinforcement Learning and Hybrid Approaches to Explore the Cross Array Task Optimal Solution
by: Samuel Corecco, et al.
Published: (2023-11-01)

The Temperature Prediction of Permanent Magnet Synchronous Machines Based on Proximal Policy Optimization
by: Yuefeng Cen, et al.
Published: (2020-10-01)

Active Exploration Deep Reinforcement Learning for Continuous Action Space with Forward Prediction
by: Dongfang Zhao, et al.
Published: (2024-01-01)

Research on air combat decision algorithm based on proximal policy optimization
by: ZHANG Bochao, et al.
Published: (2023-04-01)

Analysis of Mobile Robot Control by Reinforcement Learning Algorithm
by: Jakub Bernat, et al.
Published: (2022-05-01)

Comparative Study of Cooperative Platoon Merging Control Based on Reinforcement Learning
by: Ali Irshayyid, et al.
Published: (2023-01-01)

Impact-Angle Constraint Guidance and Control Strategies Based on Deep Reinforcement Learning
by: Junfang Fan, et al.
Published: (2023-11-01)

Time-Sensitive and Resource-Aware Concurrent Workflow Scheduling for Edge Computing Platforms Based on Deep Reinforcement Learning
by: Jiaming Zhang, et al.
Published: (2023-09-01)

Real-time security margin control using deep reinforcement learning
by: Hannes Hagmar, et al.
Published: (2023-07-01)

Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach
by: Ziyi Liang, et al.
Published: (2023-01-01)

Research on Data-Driven Optimal Scheduling of Power System
by: Jianxun Luo, et al.
Published: (2023-03-01)

Generative Adversarial Inverse Reinforcement Learning With Deep Deterministic Policy Gradient
by: Ming Zhan, et al.
Published: (2023-01-01)