A Multi-Task Vision Transformer for Segmentation and Monocular Depth Estimation for Autonomous Vehicles

In this paper, we investigate the use of Vision Transformers for processing and understanding visual data in an autonomous driving setting. Specifically, we explore the use of Vision Transformers for semantic segmentation and monocular depth estimation using only a single image as input. We present...

Full description

Bibliographic Details
Main Authors: Durga Prasad Bavirisetti, Herman Ryen Martinsen, Gabriel Hanssen Kiss, Frank Lindseth
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Open Journal of Intelligent Transportation Systems
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10330677/