Lip Reading by Alternating between Spatiotemporal and Spatial Convolutions

Lip reading (LR) is the task of predicting the speech utilizing only the visual information of the speaker. In this work, for the first time, the benefits of alternating between spatiotemporal and spatial convolutions for learning effective features from the LR sequences are studied. In this context...

Full description

Bibliographic Details
Main Authors: Dimitrios Tsourounis, Dimitris Kastaniotis, Spiros Fotopoulos
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Journal of Imaging
Subjects:
Online Access:https://www.mdpi.com/2313-433X/7/5/91