FACMAC: Factored multi−agent centralised policy gradients
We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces. Like MADDPG, a popular multi-agent actor-critic method, our approach uses deep deterministic policy gradients to learn...
Main Authors: | , , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
NeurIPS
2022
|