Semantics-aware dynamic localization and refinement for referring image segmentation
Referring image segmentation segments an image from a language expression. With the aim of producing high-quality masks, existing methods often adopt iterative learning approaches that rely on RNNs or stacked attention layers to refine vision-language features. Despite their complexity, RNN-based me...
Main Authors: | Yang, Z, Wang, J, Tang, Y, Chen, K, Zhao, H, Torr, PHS |
---|---|
格式: | Conference item |
语言: | English |
出版: |
AAAI Conference on Artificial Intelligence
2023
|
相似书籍
-
LAVT: Language-Aware Vision Transformer for referring image segmentation
由: Yang, Z, et al.
出版: (2022) -
Language-aware vision transformer for referring segmentation
由: Yang, Z, et al.
出版: (2024) -
Hierarchical interaction network for video object segmentation from referring expressions
由: Yang, Z, et al.
出版: (2021) -
Scalable cascade inference for semantic image segmentation
由: Sturgess, P, et al.
出版: (2012) -
Local and blobal GANs with semantic-aware upsampling for image generation
由: Tang, H, et al.
出版: (2022)