Deep audio-visual speech recognition

The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem -- unconstrained natural language senten...

Full description

Bibliographic Details
Main Authors:	Afouras, T, Chung, J, Senior, A, Vinyals, O, Zisserman, A
Format:	Journal article
Language:	English
Published:	IEEE 2018

Deep audio-visual speech recognition

Similar Items