QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting, where global state information is available and communication c...

Full description

Bibliographic Details
Main Authors: Rashid, T, Samvelyan, M, Schroeder de Witt, C, Farquhar, G, Foerster, J, Whiteson, S
Format: Conference item
Published: Journal of Machine Learning Research 2018