Towards unified visual perception
<p>This thesis explores the frontier of visual perception in computer vision by leveraging the capabilities of Vision Transformers (ViTs) to create a unified framework that addresses cross-task and cross-granularity challenges. Drawing inspiration from the human visual system's ability to...
Main Author: | Sun, S |
---|---|
Other Authors: | Torr, P |
Format: | Thesis |
Language: | English |
Published: |
2024
|
Subjects: |
Similar Items
-
Developing object perception in the low data regime
by: Kaul, P
Published: (2024) -
End-to-end learning, and audio-visual human-centric video understanding
by: Brown, A
Published: (2022) -
Towards diverse generation and reliable classification using neural networks
by: Kulharia, V
Published: (2022) -
Sign language understanding using multimodal learning
by: Momeni, L
Published: (2024) -
Learning with multimodal self-supervision
by: Chen, H
Published: (2021)