MFVT: Multilevel Feature Fusion Vision Transformer and RAMix Data Augmentation for Fine-Grained Visual Categorization
The introduction and application of the Vision Transformer (ViT) has promoted the development of fine-grained visual categorization (FGVC). However, there are some problems when directly applying ViT to FGVC tasks. ViT only classifies using the class token in the last layer, ignoring the local and l...
Main Authors: | Xinyao Lv, Hao Xia, Na Li, Xudong Li, Ruoming Lan |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-10-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/11/21/3552 |
Similar Items
-
Weakly Supervised Semantic and Attentive Data Mixing Augmentation for Fine-Grained Visual Categorization
by: Mengqi He, et al.
Published: (2022-01-01) -
Plant Leaf Diseases Fine-Grained Categorization Using Convolutional Neural Networks
by: Yang Wu, et al.
Published: (2022-01-01) -
Adaptive Triplet Model for Fine-Grained Visual Categorization
by: Jingyun Liang, et al.
Published: (2018-01-01) -
Deep Learning Method for Fine-Grained Image Categorization
by: LI Xiangxia, JI Xiaohui, LI Bin
Published: (2021-10-01) -
Self-Supervised Collaborative Multi-Network for Fine-Grained Visual Categorization of Tomato Diseases
by: Guofeng Yang, et al.
Published: (2020-01-01)