Learning visual prompts for guiding the attention of vision transformers
Visual prompting infuses visual information into the input image to adapt models toward specific predictions and tasks. Recently, manually crafted markers such as red circles are shown to guide the model to attend to a target region on the image. However, these markers only work on models trained wi...
Główni autorzy: | , , , , , |
---|---|
Format: | Conference item |
Język: | English |
Wydane: |
Transformers for Vision (T4V)
2024
|