Stable opponent shaping in differentiable games
A growing number of learning methods are actually differentiable games whose players optimise multiple, interdependent objectives in parallel – from GANs and intrinsic curiosity to multi-agent RL. Opponent shaping is a powerful approach to improve learning dynamics in these games, accounting for pla...
Main Authors: | , , , , |
---|---|
Format: | Conference item |
Published: |
OpenReview
2019
|