Audiovisual speech recognition based on a deep convolutional neural network
Audiovisual speech recognition is an emerging research topic. Lipreading is the recognition of what someone is saying using visual information, primarily lip movements. In this study, we created a custom dataset for Indian English linguistics and categorized it into three main categories: (1) audio...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co. Ltd.
2024-03-01
|
Series: | Data Science and Management |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666764923000450 |