SE‐Swin: An improved Swin‐Transfomer network of self‐ensemble feature extraction framework for image retrieval

Abstract The Swin‐Transformer is a variant of the Vision Transformer, which constructs a hierarchical Transformer that computes representations with shifted windows and window multi‐head self‐attention. This method can handle the scale invariance problem and performs well in many computer vision tas...

Full description

Bibliographic Details
Main Authors: Yixuan Xu, Xianbing Wang, Hua Zhang, Hai Lin
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:IET Image Processing
Subjects:
Online Access:https://doi.org/10.1049/ipr2.12929