Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning

At present,most multi-agent reinforcement learning(MARL) algorithms using the architecture of centralized training and decentralized execution(CTDE) have good results in homogeneous multi-agent systems.However,for heterogeneous multi-agent systems composed of different roles,there is always the prob...

Full description

Bibliographic Details
Main Author: SHI Dian-xi, ZHAO Chen-ran, ZHANG Yao-wen, YANG Shao-wu, ZHANG Yong-jun
Format: Article
Language:zho
Published: Editorial office of Computer Science 2022-08-01
Series:Jisuanji kexue
Subjects:
Online Access:https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-8-247.pdf