Counterfactual-Based Action Evaluation Algorithm in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) algorithms have made great achievements in various scenarios, but there are still many problems in solving sequential social dilemmas (SSDs). In SSDs, the agent’s actions not only change the instantaneous state of the environment but also affect the latent s...

Full description

Bibliographic Details
Main Authors: Yuyu Yuan, Pengqian Zhao, Ting Guo, Hongpu Jiang
Format: Article
Language:English
Published: MDPI AG 2022-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/7/3439