Anfonwch hwn fel neges destun: Language-aware vision transformer for referring segmentation