Semantics-aware dynamic localization and refinement for referring image segmentation
Referring image segmentation segments an image from a language expression. With the aim of producing high-quality masks, existing methods often adopt iterative learning approaches that rely on RNNs or stacked attention layers to refine vision-language features. Despite their complexity, RNN-based me...
Päätekijät: | Yang, Z, Wang, J, Tang, Y, Chen, K, Zhao, H, Torr, PHS |
---|---|
Aineistotyyppi: | Conference item |
Kieli: | English |
Julkaistu: |
AAAI Conference on Artificial Intelligence
2023
|
Samankaltaisia teoksia
-
LAVT: Language-Aware Vision Transformer for referring image segmentation
Tekijä: Yang, Z, et al.
Julkaistu: (2022) -
Language-aware vision transformer for referring segmentation
Tekijä: Yang, Z, et al.
Julkaistu: (2024) -
Scalable cascade inference for semantic image segmentation
Tekijä: Sturgess, P, et al.
Julkaistu: (2012) -
Hierarchical interaction network for video object segmentation from referring expressions
Tekijä: Yang, Z, et al.
Julkaistu: (2021) -
Local and blobal GANs with semantic-aware upsampling for image generation
Tekijä: Tang, H, et al.
Julkaistu: (2022)