Monotonic value function factorisation for deep multi-agent reinforcement learning
In many real-world settings, a team of agents must coordinate its behaviour while acting in a decentralised fashion. At the same time, it is often possible to train the agents in a centralised fashion where global state information is available and communication constraints are lifted. Learning join...
मुख्य लेखकों: | , , , , , |
---|---|
स्वरूप: | Journal article |
भाषा: | English |
प्रकाशित: |
Journal of Machine Learning Research
2020
|