Visual grounding in video for unsupervised word translation

There are thousands of actively spoken languages on Earth, but a single visual world. Grounding in this visual world has the potential to bridge the gap between all these languages. Our goal is to use visual grounding to improve unsupervised word mapping between languages. The key idea is to establi...

Full description

Bibliographic Details
Main Authors: Sigurdsson, GA, Alayrac, JB, Nematzadeh, A, Smaira, L, Malinowski, M, Carreira, J, Blunsom, P, Zisserman, A
Format: Journal article
Language:English
Published: IEEE 2020