Now you're speaking my language: visual language identification

The goal of this work is to train models that can identify a spoken language just by interpreting the speaker’s lip movements. Our contributions are the following: (i) we show that models can learn to discriminate among 14 different languages using only visual speech information; (ii) we compare dif...

תיאור מלא

מידע ביבליוגרפי
Main Authors: Afouras, T, Chung, JS, Zisserman, A
פורמט: Conference item
שפה:English
יצא לאור: ISCA Archive 2020