Achieving Robustness and Generalization in MARL for Sequential Social Dilemmas through Bilinear Value Networks

This thesis presents a novel approach for training multi-agent reinforcement learning (MARL) agents that are robust to different unforeseen gameplay strategies in sequential social dilemma (SSD) games. Recent literature has demonstrated that reward shaping can not only be used to enable MARL agents...

Full description

Bibliographic Details
Main Author: Ma, Jeremy
Other Authors: How, Jonathan P.
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/152745