RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers

The latest research in computer vision highlighted the effectiveness of the vision transformers (ViT) in performing several computer vision tasks; they can efficiently understand and process the image globally unlike the convolution which processes the image locally. ViTs outperform the convolutiona...

Full description

Bibliographic Details
Main Authors: Hatem Ibrahem, Ahmed Salem, Hyun-Soo Kang
Format: Article
Language:English
Published: MDPI AG 2022-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/10/3849