Deep audio-visual speech recognition
The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem -- unconstrained natural language senten...
Main Authors: | , , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
IEEE
2018
|