Deep audio-visual speech recognition

The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem -- unconstrained natural language senten...

Full description

Bibliographic Details
Main Authors: Afouras, T, Chung, J, Senior, A, Vinyals, O, Zisserman, A
Format: Journal article
Language:English
Published: IEEE 2018