A Multi-Agent Centralized Strategy Gradient Reinforcement Learning Algorithm Based on State Transition

The prevalent utilization of deterministic strategy algorithms in Multi-Agent Deep Reinforcement Learning (MADRL) for collaborative tasks has posed a significant challenge in achieving stable and high-performance cooperative behavior. Addressing the need for the balanced exploration and exploitation...

Full description

Bibliographic Details
Main Authors: Lei Sheng, Honghui Chen, Xiliang Chen
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/17/12/579