A review of audio-visual speech recognition

Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect...

Full description

Bibliographic Details
Main Authors:	Thum, Wei Seong, M. Z., Ibrahim
Format:	Article
Language:	English
Published:	UTeM 2018
Subjects:	TK Electrical engineering. Electronics Nuclear engineering
Online Access:	http://umpir.ump.edu.my/id/eprint/21637/1/A%20review%20of%20audio-visual%20speech%20recognition.pdf

_version_	1825812159669469184
author	Thum, Wei Seong M. Z., Ibrahim
author_facet	Thum, Wei Seong M. Z., Ibrahim
author_sort	Thum, Wei Seong
collection	UMP
description	Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect the overall recognition performance. Thus, Audio-Visual Speech Recognition (AVSR) is designed to overcome the problems by utilising visual images which are unaffected by noise. The aim of this paper is to discuss the AVSR structures, which includes the front end processes, audio-visual data corpus used, recent works and accuracy estimation methods.
first_indexed	2024-03-06T12:25:01Z
format	Article
id	UMPir21637
institution	Universiti Malaysia Pahang
language	English
last_indexed	2024-03-06T12:25:01Z
publishDate	2018
publisher	UTeM
record_format	dspace
spelling	UMPir216372018-09-14T07:20:30Z http://umpir.ump.edu.my/id/eprint/21637/ A review of audio-visual speech recognition Thum, Wei Seong M. Z., Ibrahim TK Electrical engineering. Electronics Nuclear engineering Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect the overall recognition performance. Thus, Audio-Visual Speech Recognition (AVSR) is designed to overcome the problems by utilising visual images which are unaffected by noise. The aim of this paper is to discuss the AVSR structures, which includes the front end processes, audio-visual data corpus used, recent works and accuracy estimation methods. UTeM 2018 Article PeerReviewed pdf en cc_by http://umpir.ump.edu.my/id/eprint/21637/1/A%20review%20of%20audio-visual%20speech%20recognition.pdf Thum, Wei Seong and M. Z., Ibrahim (2018) A review of audio-visual speech recognition. Journal of Telecommunication, Electronic and Computer Engineering, 10 (1-4). pp. 35-40. ISSN 2289-8131. (Published) http://journal.utem.edu.my/index.php/jtec/article/view/3573
spellingShingle	TK Electrical engineering. Electronics Nuclear engineering Thum, Wei Seong M. Z., Ibrahim A review of audio-visual speech recognition
title	A review of audio-visual speech recognition
title_full	A review of audio-visual speech recognition
title_fullStr	A review of audio-visual speech recognition
title_full_unstemmed	A review of audio-visual speech recognition
title_short	A review of audio-visual speech recognition
title_sort	review of audio visual speech recognition
topic	TK Electrical engineering. Electronics Nuclear engineering
url	http://umpir.ump.edu.my/id/eprint/21637/1/A%20review%20of%20audio-visual%20speech%20recognition.pdf
work_keys_str_mv	AT thumweiseong areviewofaudiovisualspeechrecognition AT mzibrahim areviewofaudiovisualspeechrecognition AT thumweiseong reviewofaudiovisualspeechrecognition AT mzibrahim reviewofaudiovisualspeechrecognition

A review of audio-visual speech recognition

Similar Items