Comparing Vision Transformers and Convolutional Neural Networks for Image Classification: A Literature Review
Transformers are models that implement a mechanism of self-attention, individually weighting the importance of each part of the input data. Their use in image classification tasks is still somewhat limited since researchers have so far chosen Convolutional Neural Networks for image classification an...
Main Authors: | José Maurício, Inês Domingues, Jorge Bernardino |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-04-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/9/5521 |
Similar Items
-
Enhancing Cervical Pre-Cancerous Classification Using Advanced Vision Transformer
by: Manal Darwish, et al.
Published: (2023-09-01) -
HGR-ViT: Hand Gesture Recognition with Vision Transformer
by: Chun Keat Tan, et al.
Published: (2023-06-01) -
Ultrasound Image Analysis with Vision Transformers—Review
by: Majid Vafaeezadeh, et al.
Published: (2024-03-01) -
Enhancing Fashion Classification with Vision Transformer (ViT) and Developing Recommendation Fashion Systems Using DINOVA2
by: Hadeer M. Abd Alaziz, et al.
Published: (2023-10-01) -
FibroVit—Vision transformer-based framework for detection and classification of pulmonary fibrosis from chest CT images
by: Muhammad Waseem Sabir, et al.
Published: (2023-11-01)