Weighted QMIX: Expanding monotonic value function factorisation for deep multi−agent reinforcement learning
QMIX is a popular Q-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action Q-values it can represent to be a monotonic mixing of each agent’s utilities. However, this restrict...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
NeurIPS
2020
|