Learning to lip read words by watching videos

<p>Our aim is to recognise the words being spoken by a talking face, given only the video but not the audio. Existing works in this area have focussed on trying to recognise a small number of utterances in controlled environments (e.g. digits and alphabets), partially due to the shortage of su...

Szczegółowa specyfikacja

Opis bibliograficzny
Główni autorzy: Chung, J, Zisserman, A
Format: Journal article
Wydane: Elsevier 2018