Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool

In some DL applications such as remote sensing, it is hard to obtain the high task performance (e.g. accuracy) using the DL model on image analysis due to the low resolution characteristics of the imagery. Accordingly, several studies attempted to provide visual explanations or apply the attention m...

Full description

Bibliographic Details
Main Authors:	Minsu Jeon, Taewoo Kim, Seonghwan Kim, Changha Lee, Chan-Hyun Youn
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Explainable AI (XAI) attention class activation map (CAM) amortized model
Online Access:	https://ieeexplore.ieee.org/document/10012347/

_version_	1797902223412297728
author	Minsu Jeon Taewoo Kim Seonghwan Kim Changha Lee Chan-Hyun Youn
author_facet	Minsu Jeon Taewoo Kim Seonghwan Kim Changha Lee Chan-Hyun Youn
author_sort	Minsu Jeon
collection	DOAJ
description	In some DL applications such as remote sensing, it is hard to obtain the high task performance (e.g. accuracy) using the DL model on image analysis due to the low resolution characteristics of the imagery. Accordingly, several studies attempted to provide visual explanations or apply the attention mechanism to enhance the reliability on the image analysis. However, there still remains structural complexity on obtaining a sophisticated visual explanation with such existing methods: 1) which layer will the visual explanation be extracted from, and 2) which layers the attention modules will be applied to. 3) Subsequently, in order to observe the aspects of visual explanations on such diverse episodes of applying attention modules individually, training cost inefficiency inevitably arises as it requires training the multiple models one by one in the conventional methods. In order to solve the problems, we propose a new scheme of mediating the visual explanations in a pixel-level recursively. Specifically, we propose DropAtt that generates multiple episodes pool by training only a single network once as an amortized model, which also shows stability on task performance regardless of layer-wise attention policy. From the multiple episodes pool generated by DropAtt, by quantitatively evaluating the explainability of each visual explanation and expanding the parts of explanations with high explainability recursively, our visual explanations mediation scheme attempts to adjust how much to reflect each episodic layer-wise explanation for enforcing a dominant explainability of each candidate. On the empirical evaluation, our methods show their feasibility on enhancing the visual explainability by reducing average drop about 17% and enhancing the rate of increase in confidence 3%.
first_indexed	2024-04-10T09:14:18Z
format	Article
id	doaj.art-7046a5535a06437d9de0c73ad352347c
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-10T09:14:18Z
publishDate	2023-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-7046a5535a06437d9de0c73ad352347c2023-02-21T00:02:50ZengIEEEIEEE Access2169-35362023-01-01114306432110.1109/ACCESS.2023.323533210012347Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes PoolMinsu Jeon0https://orcid.org/0000-0002-2739-8149Taewoo Kim1https://orcid.org/0000-0003-4290-6460Seonghwan Kim2https://orcid.org/0000-0002-3842-3821Changha Lee3https://orcid.org/0000-0003-3687-2989Chan-Hyun Youn4https://orcid.org/0000-0002-3970-7308School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaIn some DL applications such as remote sensing, it is hard to obtain the high task performance (e.g. accuracy) using the DL model on image analysis due to the low resolution characteristics of the imagery. Accordingly, several studies attempted to provide visual explanations or apply the attention mechanism to enhance the reliability on the image analysis. However, there still remains structural complexity on obtaining a sophisticated visual explanation with such existing methods: 1) which layer will the visual explanation be extracted from, and 2) which layers the attention modules will be applied to. 3) Subsequently, in order to observe the aspects of visual explanations on such diverse episodes of applying attention modules individually, training cost inefficiency inevitably arises as it requires training the multiple models one by one in the conventional methods. In order to solve the problems, we propose a new scheme of mediating the visual explanations in a pixel-level recursively. Specifically, we propose DropAtt that generates multiple episodes pool by training only a single network once as an amortized model, which also shows stability on task performance regardless of layer-wise attention policy. From the multiple episodes pool generated by DropAtt, by quantitatively evaluating the explainability of each visual explanation and expanding the parts of explanations with high explainability recursively, our visual explanations mediation scheme attempts to adjust how much to reflect each episodic layer-wise explanation for enforcing a dominant explainability of each candidate. On the empirical evaluation, our methods show their feasibility on enhancing the visual explainability by reducing average drop about 17% and enhancing the rate of increase in confidence 3%.https://ieeexplore.ieee.org/document/10012347/Explainable AI (XAI)attentionclass activation map (CAM)amortized model
spellingShingle	Minsu Jeon Taewoo Kim Seonghwan Kim Changha Lee Chan-Hyun Youn Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool IEEE Access Explainable AI (XAI) attention class activation map (CAM) amortized model
title	Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_full	Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_fullStr	Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_full_unstemmed	Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_short	Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_sort	recursive visual explanations mediation scheme based on dropattention model with multiple episodes pool
topic	Explainable AI (XAI) attention class activation map (CAM) amortized model
url	https://ieeexplore.ieee.org/document/10012347/
work_keys_str_mv	AT minsujeon recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool AT taewookim recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool AT seonghwankim recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool AT changhalee recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool AT chanhyunyoun recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool

Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool

Similar Items