Pošalji tekstualnu poruku: Language-aware vision transformer for referring segmentation