Sub-word level lip reading with visual attention

The goal of this paper is to learn strong lip reading models that can recognise speech in silent videos. Most prior works deal with the open-set visual speech recognition problem by adapting existing automatic speech recognition techniques on top of trivially pooled visual features. Instead, in this...

Descripció completa

Dades bibliogràfiques
Autors principals: Prajwal, KR, Afouras, T, Zisserman, A
Format: Conference item
Idioma:English
Publicat: IEEE 2022

Ítems similars