QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning
In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting, where global state information is available and communication c...
Main Authors: | , , , , , |
---|---|
Format: | Conference item |
Published: |
Journal of Machine Learning Research
2018
|