An optimal 3D convolutional neural network based lipreading method

Abstract Lipreading is a visual recognition of speech by using lip movement, which aims to recognise phrases and sentences spoken by a talking face without the audio. However, the existed models for lipreading suffer from slow training speed and insufficient performance. To accelerate the training s...

Full description

Bibliographic Details
Main Authors: Lun He, Biyun Ding, Hao Wang, Tao Zhang
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:IET Image Processing
Subjects:
Online Access:https://doi.org/10.1049/ipr2.12337