Audiovisual speech recognition based on a deep convolutional neural network

Audiovisual speech recognition is an emerging research topic. Lipreading is the recognition of what someone is saying using visual information, primarily lip movements. In this study, we created a custom dataset for Indian English linguistics and categorized it into three main categories: (1) audio...

Полное описание

Библиографические подробности
Главные авторы: Shashidhar Rudregowda, Sudarshan Patilkulkarni, Vinayakumar Ravi, Gururaj H.L., Moez Krichen
Формат: Статья
Язык:English
Опубликовано: KeAi Communications Co. Ltd. 2024-03-01
Серии:Data Science and Management
Предметы:
Online-ссылка:http://www.sciencedirect.com/science/article/pii/S2666764923000450