HAT: A Visual Transformer Model for Image Recognition Based on Hierarchical Attention Transformation

In the field of image recognition, Visual Transformer (ViT) has excellent performance. However, ViT, relies on a fixed self-attentive layer, tends to lead to computational redundancy and makes it difficult to maintain the integrity of the image convolutional feature sequence during the training proc...

Full description

Bibliographic Details
Main Authors: Xuanyu Zhao, Tao Hu, Chunxia Mao, Ye Yuan, Jun Li
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10247525/