Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning

At present,most multi-agent reinforcement learning(MARL) algorithms using the architecture of centralized training and decentralized execution(CTDE) have good results in homogeneous multi-agent systems.However,for heterogeneous multi-agent systems composed of different roles,there is always the prob...

Full description

Bibliographic Details
Main Author:	SHI Dian-xi, ZHAO Chen-ran, ZHANG Yao-wen, YANG Shao-wu, ZHANG Yong-jun
Format:	Article
Language:	zho
Published:	Editorial office of Computer Science 2022-08-01
Series:	Jisuanji kexue
Subjects:	multi-agent reinforcement learning\|graph attention network\|adaptive intrinsic reward
Online Access:	https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-8-247.pdf

Internet

https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-8-247.pdf

Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning

Internet

Similar Items