Learning visual prompts for guiding the attention of vision transformers

Visual prompting infuses visual information into the input image to adapt models toward specific predictions and tasks. Recently, manually crafted markers such as red circles are shown to guide the model to attend to a target region on the image. However, these markers only work on models trained wi...

Täydet tiedot

Bibliografiset tiedot
Päätekijät: Rezaei, R, Sabet, MJ, Gu, J, Rueckert, D, Torr, P, Khakzar, A
Aineistotyyppi: Conference item
Kieli:English
Julkaistu: Transformers for Vision (T4V) 2024