Learning visual prompts for guiding the attention of vision transformers

Visual prompting infuses visual information into the input image to adapt models toward specific predictions and tasks. Recently, manually crafted markers such as red circles are shown to guide the model to attend to a target region on the image. However, these markers only work on models trained wi...

Olles dieđut

Bibliográfalaš dieđut
Váldodahkkit: Rezaei, R, Sabet, MJ, Gu, J, Rueckert, D, Torr, P, Khakzar, A
Materiálatiipa: Conference item
Giella:English
Almmustuhtton: Transformers for Vision (T4V) 2024