Learning visual prompts for guiding the attention of vision transformers

Visual prompting infuses visual information into the input image to adapt models toward specific predictions and tasks. Recently, manually crafted markers such as red circles are shown to guide the model to attend to a target region on the image. However, these markers only work on models trained wi...

Szczegółowa specyfikacja

Opis bibliograficzny
Główni autorzy: Rezaei, R, Sabet, MJ, Gu, J, Rueckert, D, Torr, P, Khakzar, A
Format: Conference item
Język:English
Wydane: Transformers for Vision (T4V) 2024