Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool

In some DL applications such as remote sensing, it is hard to obtain the high task performance (e.g. accuracy) using the DL model on image analysis due to the low resolution characteristics of the imagery. Accordingly, several studies attempted to provide visual explanations or apply the attention m...

Full description

Bibliographic Details
Main Authors: Minsu Jeon, Taewoo Kim, Seonghwan Kim, Changha Lee, Chan-Hyun Youn
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10012347/
_version_ 1797902223412297728
author Minsu Jeon
Taewoo Kim
Seonghwan Kim
Changha Lee
Chan-Hyun Youn
author_facet Minsu Jeon
Taewoo Kim
Seonghwan Kim
Changha Lee
Chan-Hyun Youn
author_sort Minsu Jeon
collection DOAJ
description In some DL applications such as remote sensing, it is hard to obtain the high task performance (e.g. accuracy) using the DL model on image analysis due to the low resolution characteristics of the imagery. Accordingly, several studies attempted to provide visual explanations or apply the attention mechanism to enhance the reliability on the image analysis. However, there still remains structural complexity on obtaining a sophisticated visual explanation with such existing methods: 1) which layer will the visual explanation be extracted from, and 2) which layers the attention modules will be applied to. 3) Subsequently, in order to observe the aspects of visual explanations on such diverse episodes of applying attention modules individually, training cost inefficiency inevitably arises as it requires training the multiple models one by one in the conventional methods. In order to solve the problems, we propose a new scheme of mediating the visual explanations in a pixel-level recursively. Specifically, we propose DropAtt that generates multiple episodes pool by training only a single network once as an amortized model, which also shows stability on task performance regardless of layer-wise attention policy. From the multiple episodes pool generated by DropAtt, by quantitatively evaluating the explainability of each visual explanation and expanding the parts of explanations with high explainability recursively, our visual explanations mediation scheme attempts to adjust how much to reflect each episodic layer-wise explanation for enforcing a dominant explainability of each candidate. On the empirical evaluation, our methods show their feasibility on enhancing the visual explainability by reducing average drop about 17% and enhancing the rate of increase in confidence 3%.
first_indexed 2024-04-10T09:14:18Z
format Article
id doaj.art-7046a5535a06437d9de0c73ad352347c
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-10T09:14:18Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-7046a5535a06437d9de0c73ad352347c2023-02-21T00:02:50ZengIEEEIEEE Access2169-35362023-01-01114306432110.1109/ACCESS.2023.323533210012347Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes PoolMinsu Jeon0https://orcid.org/0000-0002-2739-8149Taewoo Kim1https://orcid.org/0000-0003-4290-6460Seonghwan Kim2https://orcid.org/0000-0002-3842-3821Changha Lee3https://orcid.org/0000-0003-3687-2989Chan-Hyun Youn4https://orcid.org/0000-0002-3970-7308School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, South KoreaIn some DL applications such as remote sensing, it is hard to obtain the high task performance (e.g. accuracy) using the DL model on image analysis due to the low resolution characteristics of the imagery. Accordingly, several studies attempted to provide visual explanations or apply the attention mechanism to enhance the reliability on the image analysis. However, there still remains structural complexity on obtaining a sophisticated visual explanation with such existing methods: 1) which layer will the visual explanation be extracted from, and 2) which layers the attention modules will be applied to. 3) Subsequently, in order to observe the aspects of visual explanations on such diverse episodes of applying attention modules individually, training cost inefficiency inevitably arises as it requires training the multiple models one by one in the conventional methods. In order to solve the problems, we propose a new scheme of mediating the visual explanations in a pixel-level recursively. Specifically, we propose DropAtt that generates multiple episodes pool by training only a single network once as an amortized model, which also shows stability on task performance regardless of layer-wise attention policy. From the multiple episodes pool generated by DropAtt, by quantitatively evaluating the explainability of each visual explanation and expanding the parts of explanations with high explainability recursively, our visual explanations mediation scheme attempts to adjust how much to reflect each episodic layer-wise explanation for enforcing a dominant explainability of each candidate. On the empirical evaluation, our methods show their feasibility on enhancing the visual explainability by reducing average drop about 17% and enhancing the rate of increase in confidence 3%.https://ieeexplore.ieee.org/document/10012347/Explainable AI (XAI)attentionclass activation map (CAM)amortized model
spellingShingle Minsu Jeon
Taewoo Kim
Seonghwan Kim
Changha Lee
Chan-Hyun Youn
Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
IEEE Access
Explainable AI (XAI)
attention
class activation map (CAM)
amortized model
title Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_full Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_fullStr Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_full_unstemmed Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_short Recursive Visual Explanations Mediation Scheme Based on DropAttention Model With Multiple Episodes Pool
title_sort recursive visual explanations mediation scheme based on dropattention model with multiple episodes pool
topic Explainable AI (XAI)
attention
class activation map (CAM)
amortized model
url https://ieeexplore.ieee.org/document/10012347/
work_keys_str_mv AT minsujeon recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool
AT taewookim recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool
AT seonghwankim recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool
AT changhalee recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool
AT chanhyunyoun recursivevisualexplanationsmediationschemebasedondropattentionmodelwithmultipleepisodespool