QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning
In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting, where global state information is available and communication c...
Principais autores: | , , , , , |
---|---|
Formato: | Conference item |
Publicado em: |
Journal of Machine Learning Research
2018
|