Achieving Robustness and Generalization in MARL for Sequential Social Dilemmas through Bilinear Value Networks
This thesis presents a novel approach for training multi-agent reinforcement learning (MARL) agents that are robust to different unforeseen gameplay strategies in sequential social dilemma (SSD) games. Recent literature has demonstrated that reward shaping can not only be used to enable MARL agents...
Main Author: | Ma, Jeremy |
---|---|
Other Authors: | How, Jonathan P. |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2023
|
Online Access: | https://hdl.handle.net/1721.1/152745 |
Similar Items
-
Robust parameter estimation method for bilinear model
by: Ismail, Mohd Isfahani, et al.
Published: (2015) -
Filtering for bilinear systems
by: Vallot, Lawrence Charles
Published: (2005) -
Marling a Regosol of Central Java and its Effect on Maize Crop Performance
by: Kertonegoro, Babang Djadmo
Published: (2000) -
Robust sequential decision-making on networks
by: Dubey, Abhimanyu.
Published: (2020) -
On the stability of bilinear stochastic systems
Published: (2003)