Spontaneous speech recognition using HMMs

Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2003.

Bibliographic Details
Main Author:	Yoder, Benjamin W. (Benjamin Wesley), 1977-
Other Authors:	Deb Roy.
Format:	Thesis
Language:	eng
Published:	Massachusetts Institute of Technology 2007
Subjects:	Electrical Engineering and Computer Science.
Online Access:	http://hdl.handle.net/1721.1/36108

_version_	1826211381378023424
author	Yoder, Benjamin W. (Benjamin Wesley), 1977-
author2	Deb Roy.
author_facet	Deb Roy. Yoder, Benjamin W. (Benjamin Wesley), 1977-
author_sort	Yoder, Benjamin W. (Benjamin Wesley), 1977-
collection	MIT
description	Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2003.
first_indexed	2024-09-23T15:05:00Z
format	Thesis
id	mit-1721.1/36108
institution	Massachusetts Institute of Technology
language	eng
last_indexed	2024-09-23T15:05:00Z
publishDate	2007
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/361082019-04-11T07:04:50Z Spontaneous speech recognition using HMMs Yoder, Benjamin W. (Benjamin Wesley), 1977- Deb Roy. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2003. Includes bibliographical references (leaf 63). This thesis describes a speech recognition system that was built to support spontaneous speech understanding. The system is composed of (1) a front end acoustic analyzer which computes Mel-frequency cepstral coefficients, (2) acoustic models of context-dependent phonemes (triphones), (3) a back-off bigram statistical language model, and (4) a beam search decoder based on the Viterbi algorithm. The contextdependent acoustic models resulted in 67.9% phoneme recognition accuracy on the standard TIMIT speech database. Spontaneous speech was collected using a "Wizard of Oz" simulation of a simple spatial manipulation game. Naive subjects were instructed to manipulate blocks on a computer screen in order to solve a series of geometric puzzles using only spoken commands. A hidden human operator performed actions in response to each spoken command. The speech from thirteen subjects formed the corpus for the speech recognition results reported here. Using a task-specific bigram statistical language model and context-dependent acoustic models, the system achieved a word recognition accuracy of 67.6%. The recognizer operated using a vocabulary of 523 words. The recognition had a word perplexity of 36. by Benjamin W. Yoder. M.Eng. 2007-02-21T11:25:34Z 2007-02-21T11:25:34Z 2001 2003 Thesis http://hdl.handle.net/1721.1/36108 52206486 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 63 leaves application/pdf Massachusetts Institute of Technology
spellingShingle	Electrical Engineering and Computer Science. Yoder, Benjamin W. (Benjamin Wesley), 1977- Spontaneous speech recognition using HMMs
title	Spontaneous speech recognition using HMMs
title_full	Spontaneous speech recognition using HMMs
title_fullStr	Spontaneous speech recognition using HMMs
title_full_unstemmed	Spontaneous speech recognition using HMMs
title_short	Spontaneous speech recognition using HMMs
title_sort	spontaneous speech recognition using hmms
topic	Electrical Engineering and Computer Science.
url	http://hdl.handle.net/1721.1/36108
work_keys_str_mv	AT yoderbenjaminwbenjaminwesley1977 spontaneousspeechrecognitionusinghmms

Spontaneous speech recognition using HMMs

Similar Items