Achieving Robustness and Generalization in MARL
for Sequential Social Dilemmas through Bilinear
Value Networks

Achieving Robustness and Generalization in MARL for Sequential Social Dilemmas through Bilinear Value Networks

This thesis presents a novel approach for training multi-agent reinforcement learning (MARL) agents that are robust to different unforeseen gameplay strategies in sequential social dilemma (SSD) games. Recent literature has demonstrated that reward shaping can not only be used to enable MARL agents...

Full description

Bibliographic Details
Main Author:	Ma, Jeremy
Other Authors:	How, Jonathan P.
Format:	Thesis
Published:	Massachusetts Institute of Technology 2023
Online Access:	https://hdl.handle.net/1721.1/152745

Similar Items

Robust parameter estimation method for bilinear model
by: Ismail, Mohd Isfahani, et al.
Published: (2015)

Filtering for bilinear systems
by: Vallot, Lawrence Charles
Published: (2005)

Marling a Regosol of Central Java and its Effect on Maize Crop Performance
by: Kertonegoro, Babang Djadmo
Published: (2000)

Robust sequential decision-making on networks
by: Dubey, Abhimanyu.
Published: (2020)

On the stability of bilinear stochastic systems
Published: (2003)

Signal approximation using the bilinear transform
by: Venkataraman, Archana, Ph. D. Massachusetts Institute of Technology
Published: (2009)

Input classes for identifiability of bilinear systems
by: Sontag, Eduardo D., et al.
Published: (2011)

FlightMARL: A Multi-Agent Reinforcement Learning Framework for Vision-Based Control of Autonomous Quadrotors
by: Shubert, Ryan
Published: (2022)

Outlier evaluation for the bilinear time series model.
by: Mohamed, I.B., et al.
Published: (2008)

Nonlinearity Tests For Bilinear Time Series Data
by: Mohamed, Ibrahim, et al.
Published: (2005)

Semigrup yang dikonstruksikan dari bentuk bilinear
by: , KARYATI, et al.
Published: (2002)

Statistical zapr arguments from bilinear maps
Published: (2021)

Statistical zapr arguments from bilinear maps
by: Lombardi, Alex, et al.
Published: (2022)

Matrix algorithms for bilinear estimation problems in chemometrics
by: Kim, Ryan Royce.
Published: (2022)

On a bilinear Strichartz estimate on irrational tori
by: Fan, Chenjie, et al.
Published: (2018)

Representation Learning for Extrapolation via Bilinear Transduction
by: Spiride, Andrei
Published: (2024)

Lapuran Persidangan: EGIS/MARl '94, La Defense Paris, France 29 March - 1 April, 1994
by: Abdul Rahman, Alias
Published: (1994)

The social dilemma of autonomous vehicles
by: Bonnefon, J-F, et al.
Published: (2021)

The social dilemma of autonomous vehicles
by: Bonnefon, Jean-François, et al.
Published: (2021)

Generalized precedent logics for resolving insecurity dilemmas
by: Alker, Hayward R., et al.
Published: (2014)

Implicit coalitions in a generalized prisoners's dilemma
by: Fader, Peter S., et al.
Published: (2009)

An improved bilinear restriction estimate for the paraboloid in R3
by: Oh, Changkeun
Published: (2023)

Analysis of Bilinear Distillation Column Using Tubular Model
by: Ibrahim, Norazlin
Published: (2004)

Dynamic analysis of adobe structure with bilinear material modelling
by: Lye, Jiun Yin
Published: (2018)

Seismic response of segmental buildings with bilinear isolation systems
by: Zhu, Zhuo Fei.
Published: (2010)

Achieving the Holevo bound via sequential measurements
by: Giovannetti, Vittorio, et al.
Published: (2012)

Social change and social contradictions: The China dilemma
by: Yeoh, Emile Kok Kheng
Published: (2012)

Robust and Adaptive Sequential Submodular Optimization
by: Tzoumas, V, et al.
Published: (2023)

Mitigating Generative Agent Social Dilemmas
by: Yocum, Julian R.
Published: (2024)

Advances in signatures, encryption, and E-Cash from bilinear groups
by: Hohenberger, Susan Rae, 1978-
Published: (2007)

Probabilistic derivation of a bilinear summation formula for the Meixner-Pollaczek polynominals
by: Lee, P.A.
Published: (1980)

Bilinear pairings computation using the extended double-base chains algorithm
by: Mohammed Ismail, Abdulwahed, et al.
Published: (2010)

An integer linear programming approach for a class of bilinear integer programs
by: Hu, Wuhua, et al.
Published: (2014)

The use of bilinearly weighted cross sections for few-group transient analysis
by: Kim, Myung Hyun
Published: (2005)

Violation of Moral Values as Seen in Shawâ��s The Doctorâ��s Dilemma
by: , AMARYLIANI SUKMA GUSTARIANA, et al.
Published: (2014)

Social traps and ethical dilemmas: an Islamic perspective
by: Fontaine, Rodrigue
Published: (2010)

Effacing the Dilemma of the Rumoring Subject: A Value-Oriented Approach towards Studying Misinformation on Social Media
by: Aricat, Rajiv
Published: (2017)

The Malay Muslim dilemma in Malaysia after the 12th general election
by: Nor, M.R.M., et al.
Published: (2013)

Achieving competitive advantage through service quality at Singapore general hospital.
by: Asha Germurkh Aswani, et al.
Published: (2014)

Bilinear-pairing-based remote user authentication schemes using smart cards
by: Pathan, Al-Sakib Khan, et al.
Published: (2009)