Stabilising experience replay for deep multi-agent reinforcement learning
Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing multi-agent RL methods typically scale poorly in the problem size. Therefore, a key challenge is to translate the success o...
Автори: | , , , , , |
---|---|
Формат: | Conference item |
Опубліковано: |
PMLR
2017
|