Towards unified visual perception
<p>This thesis explores the frontier of visual perception in computer vision by leveraging the capabilities of Vision Transformers (ViTs) to create a unified framework that addresses cross-task and cross-granularity challenges. Drawing inspiration from the human visual system's ability to...
Main Author: | Sun, S |
---|---|
Other Authors: | Torr, P |
Format: | Thesis |
Language: | English |
Published: |
2024
|
Subjects: |
Similar Items
-
Developing object perception in the low data regime
by: Kaul, P
Published: (2024) -
End-to-end learning, and audio-visual human-centric video understanding
by: Brown, A
Published: (2022) -
Towards diverse generation and reliable classification using neural networks
by: Kulharia, V
Published: (2022) -
Sign language understanding using multimodal learning
by: Momeni, L
Published: (2024) -
Deep vision for indoor understanding and localisation
by: Howard-Jenkins, H
Published: (2022)