Reducing Q-Value Estimation Bias via Mutual Estimation and Softmax Operation in MADRL

With the development of electronic game technology, the content of electronic games presents a larger number of units, richer unit attributes, more complex game mechanisms, and more diverse team strategies. Multi-agent deep reinforcement learning shines brightly in this type of team electronic game,...

Full description

Bibliographic Details
Main Authors:	Zheng Li, Xinkai Chen, Jiaqing Fu, Ning Xie, Tingting Zhao
Format:	Article
Language:	English
Published:	MDPI AG 2024-01-01
Series:	Algorithms
Subjects:	reinforcement learning game AI multi-agent Q-network mutual estimation softmax bellman operation reinforcement learning environment
Online Access:	https://www.mdpi.com/1999-4893/17/1/36

Internet

https://www.mdpi.com/1999-4893/17/1/36

Reducing Q-Value Estimation Bias via Mutual Estimation and Softmax Operation in MADRL

Internet

Similar Items