Semantics-aware dynamic localization and refinement for referring image segmentation
Referring image segmentation segments an image from a language expression. With the aim of producing high-quality masks, existing methods often adopt iterative learning approaches that rely on RNNs or stacked attention layers to refine vision-language features. Despite their complexity, RNN-based me...
主要な著者: | Yang, Z, Wang, J, Tang, Y, Chen, K, Zhao, H, Torr, PHS |
---|---|
フォーマット: | Conference item |
言語: | English |
出版事項: |
AAAI Conference on Artificial Intelligence
2023
|
類似資料
-
LAVT: Language-Aware Vision Transformer for referring image segmentation
著者:: Yang, Z, 等
出版事項: (2022) -
Language-aware vision transformer for referring segmentation
著者:: Yang, Z, 等
出版事項: (2024) -
Scalable cascade inference for semantic image segmentation
著者:: Sturgess, P, 等
出版事項: (2012) -
Hierarchical interaction network for video object segmentation from referring expressions
著者:: Yang, Z, 等
出版事項: (2021) -
Local and blobal GANs with semantic-aware upsampling for image generation
著者:: Tang, H, 等
出版事項: (2022)