Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention

Accurate classification of moss species is essential for progress in ecology and biology. However, traditional methods for classifying moss require significant expertise, and current deep learning techniques struggle due to limited dataset diversity and poor performance in multi-class classification...

Full description

Bibliographic Details
Main Authors: Peichen Li, Huiqin Wang, Zhan Wang, Ke Wang, Chong Wang
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10496696/
_version_ 1797201296183263232
author Peichen Li
Huiqin Wang
Zhan Wang
Ke Wang
Chong Wang
author_facet Peichen Li
Huiqin Wang
Zhan Wang
Ke Wang
Chong Wang
author_sort Peichen Li
collection DOAJ
description Accurate classification of moss species is essential for progress in ecology and biology. However, traditional methods for classifying moss require significant expertise, and current deep learning techniques struggle due to limited dataset diversity and poor performance in multi-class classification tasks. To overcome these challenges, we proposed the Swin Routiformer, a new algorithm for moss image classification that enhances the Swin Transformer with bi-level routing attention. Addressing the issue of limited data, we constructed a dataset with images of 110 different moss types. Additionally, we propose the Crop-Similar data augmentation algorithm, specifically designed for moss images, to reduce background noise interference and prevent information loss due to feature scaling. Adopting the Swin Transformer model with its multi-level hierarchical architecture for visual feature extraction, we introduce the Swin Routiformer Block, which enhances the network’s feature interaction capabilities, reduces computational complexity, and improves classification accuracy and image processing speed for moss species. Our experimental results show that the Swin Routiformer achieves a top-1 accuracy of 82.19% and an f1-score of 82.79% on the test set, outperforming most mainstream models by 4.53% and 1.81% respectively compared to the baseline Swin Transformer model. These findings establish the Swin Routiformer as a valuable tool for the precise identification of moss species, offering significant contributions to the related fields.
first_indexed 2024-04-24T07:45:17Z
format Article
id doaj.art-043dde3c7d9b40e695870ed89bc6f1c9
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-24T07:45:17Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-043dde3c7d9b40e695870ed89bc6f1c92024-04-18T23:00:48ZengIEEEIEEE Access2169-35362024-01-0112533965340710.1109/ACCESS.2024.338754110496696Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing AttentionPeichen Li0https://orcid.org/0009-0000-5051-2168Huiqin Wang1Zhan Wang2Ke Wang3Chong Wang4School of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an, Shaanxi, ChinaSchool of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an, Shaanxi, ChinaShaanxi Institute for the Preservation of Cultural Heritage, Xi’an, Shaanxi, ChinaSchool of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an, Shaanxi, ChinaShaanxi Institute for the Preservation of Cultural Heritage, Xi’an, Shaanxi, ChinaAccurate classification of moss species is essential for progress in ecology and biology. However, traditional methods for classifying moss require significant expertise, and current deep learning techniques struggle due to limited dataset diversity and poor performance in multi-class classification tasks. To overcome these challenges, we proposed the Swin Routiformer, a new algorithm for moss image classification that enhances the Swin Transformer with bi-level routing attention. Addressing the issue of limited data, we constructed a dataset with images of 110 different moss types. Additionally, we propose the Crop-Similar data augmentation algorithm, specifically designed for moss images, to reduce background noise interference and prevent information loss due to feature scaling. Adopting the Swin Transformer model with its multi-level hierarchical architecture for visual feature extraction, we introduce the Swin Routiformer Block, which enhances the network’s feature interaction capabilities, reduces computational complexity, and improves classification accuracy and image processing speed for moss species. Our experimental results show that the Swin Routiformer achieves a top-1 accuracy of 82.19% and an f1-score of 82.79% on the test set, outperforming most mainstream models by 4.53% and 1.81% respectively compared to the baseline Swin Transformer model. These findings establish the Swin Routiformer as a valuable tool for the precise identification of moss species, offering significant contributions to the related fields.https://ieeexplore.ieee.org/document/10496696/Moss classificationdeep learningattention mechanismimage classificationswin transformer
spellingShingle Peichen Li
Huiqin Wang
Zhan Wang
Ke Wang
Chong Wang
Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
IEEE Access
Moss classification
deep learning
attention mechanism
image classification
swin transformer
title Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
title_full Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
title_fullStr Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
title_full_unstemmed Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
title_short Swin Routiformer: Moss Classification Algorithm Based on Swin Transformer With Bi-Level Routing Attention
title_sort swin routiformer moss classification algorithm based on swin transformer with bi level routing attention
topic Moss classification
deep learning
attention mechanism
image classification
swin transformer
url https://ieeexplore.ieee.org/document/10496696/
work_keys_str_mv AT peichenli swinroutiformermossclassificationalgorithmbasedonswintransformerwithbilevelroutingattention
AT huiqinwang swinroutiformermossclassificationalgorithmbasedonswintransformerwithbilevelroutingattention
AT zhanwang swinroutiformermossclassificationalgorithmbasedonswintransformerwithbilevelroutingattention
AT kewang swinroutiformermossclassificationalgorithmbasedonswintransformerwithbilevelroutingattention
AT chongwang swinroutiformermossclassificationalgorithmbasedonswintransformerwithbilevelroutingattention