Statistical modeling for speech recognition

The demand of intelligent machines that may recognize the spoken speech and respond in a natural voice has been driving speech research. The challenging in speech recognition systems due to the language nature where there are no clear boundaries between words, the phonetic beginning and ending are...

全面介绍

书目详细资料
Main Authors:	Khalifa, Othman Omran, El-Darymli, Khalid Khalil, Hassan Abdalla Hashim, Aisha, Daoud, Jamal Ibrahim
格式:	文件
语言:	English
出版:	IDOSI Publication 2013
主题:	T10.5 Communication of technical information
在线阅读:	http://irep.iium.edu.my/29656/1/21.pdf

_version_	1825647238499532800
author	Khalifa, Othman Omran El-Darymli, Khalid Khalil Hassan Abdalla Hashim, Aisha Daoud, Jamal Ibrahim
author_facet	Khalifa, Othman Omran El-Darymli, Khalid Khalil Hassan Abdalla Hashim, Aisha Daoud, Jamal Ibrahim
author_sort	Khalifa, Othman Omran
collection	IIUM
description	The demand of intelligent machines that may recognize the spoken speech and respond in a natural voice has been driving speech research. The challenging in speech recognition systems due to the language nature where there are no clear boundaries between words, the phonetic beginning and ending are influenced by neighbouring words, in addition to the variability in different speakers speech: male or female, young or senior, loud or low speech, read or spontaneous, emotional or formal, fast or slow speaking rate and the speech signal can be affected with environment noise. To avoid these difficulties the data driven statistical approach based on large quantities of spoken data is used. The performance of speech recognition systems is still far worse than that of humans. This is partly caused by the use of poor statistical models. In this paper, a comprehensive study of statistical methods for speech and language processing are presented. The role of signal processing in creating a reliable feature set for the recognizer and the role of statistical methods in enabling the recognizer to recognize the words of the spoken input sentence as well as the meaning associated with the recognized word sequence were presented.
first_indexed	2024-03-05T23:14:41Z
format	Article
id	oai:generic.eprints.org:29656
institution	International Islamic University Malaysia
language	English
last_indexed	2024-03-05T23:14:41Z
publishDate	2013
publisher	IDOSI Publication
record_format	dspace
spelling	oai:generic.eprints.org:296562020-10-26T00:45:11Z http://irep.iium.edu.my/29656/ Statistical modeling for speech recognition Khalifa, Othman Omran El-Darymli, Khalid Khalil Hassan Abdalla Hashim, Aisha Daoud, Jamal Ibrahim T10.5 Communication of technical information The demand of intelligent machines that may recognize the spoken speech and respond in a natural voice has been driving speech research. The challenging in speech recognition systems due to the language nature where there are no clear boundaries between words, the phonetic beginning and ending are influenced by neighbouring words, in addition to the variability in different speakers speech: male or female, young or senior, loud or low speech, read or spontaneous, emotional or formal, fast or slow speaking rate and the speech signal can be affected with environment noise. To avoid these difficulties the data driven statistical approach based on large quantities of spoken data is used. The performance of speech recognition systems is still far worse than that of humans. This is partly caused by the use of poor statistical models. In this paper, a comprehensive study of statistical methods for speech and language processing are presented. The role of signal processing in creating a reliable feature set for the recognizer and the role of statistical methods in enabling the recognizer to recognize the words of the spoken input sentence as well as the meaning associated with the recognized word sequence were presented. IDOSI Publication 2013-03 Article PeerReviewed application/pdf en http://irep.iium.edu.my/29656/1/21.pdf Khalifa, Othman Omran and El-Darymli, Khalid Khalil and Hassan Abdalla Hashim, Aisha and Daoud, Jamal Ibrahim (2013) Statistical modeling for speech recognition. World Applied Sciences Journal (21). pp. 115-122. ISSN 1818-4952 http://www.idosi.org/wasj/WASJ21(mae)13/21.pdf 10.5829/idosi.wasj.2013.21.mae.99935
spellingShingle	T10.5 Communication of technical information Khalifa, Othman Omran El-Darymli, Khalid Khalil Hassan Abdalla Hashim, Aisha Daoud, Jamal Ibrahim Statistical modeling for speech recognition
title	Statistical modeling for speech recognition
title_full	Statistical modeling for speech recognition
title_fullStr	Statistical modeling for speech recognition
title_full_unstemmed	Statistical modeling for speech recognition
title_short	Statistical modeling for speech recognition
title_sort	statistical modeling for speech recognition
topic	T10.5 Communication of technical information
url	http://irep.iium.edu.my/29656/1/21.pdf
work_keys_str_mv	AT khalifaothmanomran statisticalmodelingforspeechrecognition AT eldarymlikhalidkhalil statisticalmodelingforspeechrecognition AT hassanabdallahashimaisha statisticalmodelingforspeechrecognition AT daoudjamalibrahim statisticalmodelingforspeechrecognition

Statistical modeling for speech recognition

相似书籍