UneVEn: Universal value exploration for multi-agent reinforcement learning
VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a centralized action value function as a monotonic mixing of per-agent utilities. While this enables easy decentralization of the learned policy, the restricted joint action value function can prevent them from solvi...
Main Authors: | Gupta, T, Mahajan, A, Peng, B, Boehmer, W, Whiteson, S |
---|---|
Format: | Conference item |
Sprog: | English |
Udgivet: |
PMLR
2021
|
Lignende værker
-
Randomized entity-wise factorization for multi-agent reinforcement learning
af: Iqbal, S, et al.
Udgivet: (2021) -
Multi-agent common knowledge reinforcement learning
af: de Witt, C, et al.
Udgivet: (2019) -
MAVEN: Multi-Agent Variational Exploration
af: Mahajan, A, et al.
Udgivet: (2019) -
RODE: learning roles to decompose multi−agent tasks
af: Wang, T, et al.
Udgivet: (2021) -
Weighted QMIX: Expanding monotonic value function factorisation for deep multi−agent reinforcement learning
af: Rashid, T, et al.
Udgivet: (2020)