eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems

In this paper, we propose an enhanced version of the distributed attentional actor architecture (eDA3-X) for model-free reinforcement learning. This architecture is designed to facilitate the interpretability of learned coordinated behaviors in multi-agent systems through the use of a saliency vecto...

Full description

Bibliographic Details
Main Authors:	Yoshinari Motokawa, Toshiharu Sugawara
Format:	Article
Language:	English
Published:	MDPI AG 2023-07-01
Series:	Applied Sciences
Subjects:	multi-agent deep reinforcement learning explainable reinforcement learning distributed system attentional mechanism coordination cooperation
Online Access:	https://www.mdpi.com/2076-3417/13/14/8454

_version_	1797590381803601920
author	Yoshinari Motokawa Toshiharu Sugawara
author_facet	Yoshinari Motokawa Toshiharu Sugawara
author_sort	Yoshinari Motokawa
collection	DOAJ
description	In this paper, we propose an enhanced version of the distributed attentional actor architecture (eDA3-X) for model-free reinforcement learning. This architecture is designed to facilitate the interpretability of learned coordinated behaviors in multi-agent systems through the use of a saliency vector that captures partial observations of the environment. Our proposed method, in principle, can be integrated with any deep reinforcement learning method, as indicated by <i>X</i>, and can help us identify the information in input data that individual agents attend to during and after training. We then validated eDA3-X through experiments in the object collection game. We also analyzed the relationship between cooperative behaviors and three types of attention heatmaps (standard, positional, and class attentions), which provided insight into the information that the agents consider crucial when making decisions. In addition, we investigated how attention is developed by an agent through training experiences. Our experiments indicate that our approach offers a promising solution for understanding coordinated behaviors in multi-agent reinforcement learning.
first_indexed	2024-03-11T01:19:40Z
format	Article
id	doaj.art-8d9efa7a731b4ddfadbd8009cedbaa51
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-11T01:19:40Z
publishDate	2023-07-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-8d9efa7a731b4ddfadbd8009cedbaa512023-11-18T18:13:19ZengMDPI AGApplied Sciences2076-34172023-07-011314845410.3390/app13148454eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent SystemsYoshinari Motokawa0Toshiharu Sugawara1Department of Computer Science, Waseda University, Tokyo 169-8555, JapanDepartment of Computer Science, Waseda University, Tokyo 169-8555, JapanIn this paper, we propose an enhanced version of the distributed attentional actor architecture (eDA3-X) for model-free reinforcement learning. This architecture is designed to facilitate the interpretability of learned coordinated behaviors in multi-agent systems through the use of a saliency vector that captures partial observations of the environment. Our proposed method, in principle, can be integrated with any deep reinforcement learning method, as indicated by <i>X</i>, and can help us identify the information in input data that individual agents attend to during and after training. We then validated eDA3-X through experiments in the object collection game. We also analyzed the relationship between cooperative behaviors and three types of attention heatmaps (standard, positional, and class attentions), which provided insight into the information that the agents consider crucial when making decisions. In addition, we investigated how attention is developed by an agent through training experiences. Our experiments indicate that our approach offers a promising solution for understanding coordinated behaviors in multi-agent reinforcement learning.https://www.mdpi.com/2076-3417/13/14/8454multi-agent deep reinforcement learningexplainable reinforcement learningdistributed systemattentional mechanismcoordinationcooperation
spellingShingle	Yoshinari Motokawa Toshiharu Sugawara eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems Applied Sciences multi-agent deep reinforcement learning explainable reinforcement learning distributed system attentional mechanism coordination cooperation
title	eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems
title_full	eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems
title_fullStr	eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems
title_full_unstemmed	eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems
title_short	eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems
title_sort	eda3 x distributed attentional actor architecture for interpretability of coordinated behaviors in multi agent systems
topic	multi-agent deep reinforcement learning explainable reinforcement learning distributed system attentional mechanism coordination cooperation
url	https://www.mdpi.com/2076-3417/13/14/8454
work_keys_str_mv	AT yoshinarimotokawa eda3xdistributedattentionalactorarchitectureforinterpretabilityofcoordinatedbehaviorsinmultiagentsystems AT toshiharusugawara eda3xdistributedattentionalactorarchitectureforinterpretabilityofcoordinatedbehaviorsinmultiagentsystems

eDA3-X: Distributed Attentional Actor Architecture for Interpretability of Coordinated Behaviors in Multi-Agent Systems

Similar Items