Using graphone models in automatic speech recognition

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.

Bibliographic Details
Main Author:	Wang, Stanley Xinlei
Other Authors:	James R. Glass and I. Lee Hetherington.
Format:	Thesis
Language:	eng
Published:	Massachusetts Institute of Technology 2010
Subjects:	Electrical Engineering and Computer Science.
Online Access:	http://hdl.handle.net/1721.1/53114

_version_	1826214409415950336
author	Wang, Stanley Xinlei
author2	James R. Glass and I. Lee Hetherington.
author_facet	James R. Glass and I. Lee Hetherington. Wang, Stanley Xinlei
author_sort	Wang, Stanley Xinlei
collection	MIT
description	Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.
first_indexed	2024-09-23T16:04:59Z
format	Thesis
id	mit-1721.1/53114
institution	Massachusetts Institute of Technology
language	eng
last_indexed	2024-09-23T16:04:59Z
publishDate	2010
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/531142019-04-12T13:44:09Z Using graphone models in automatic speech recognition Wang, Stanley Xinlei James R. Glass and I. Lee Hetherington. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009. Includes bibliographical references (p. 87-90). This research explores applications of joint letter-phoneme subwords, known as graphones, in several domains to enable detection and recognition of previously unknown words. For these experiments, graphones models are integrated into the SUMMIT speech recognition framework. First, graphones are applied to automatically generate pronunciations of restaurant names for a speech recognizer. Word recognition evaluations show that graphones are effective for generating pronunciations for these words. Next, a graphone hybrid recognizer is built and tested for searching song lyrics by voice, as well as transcribing spoken lectures in a open vocabulary scenario. These experiments demonstrate significant improvement over traditional word-only speech recognizers. Modifications to the flat hybrid model such as reducing the graphone set size are also considered. Finally, a hierarchical hybrid model is built and compared with the flat hybrid model on the lecture transcription task. by Stanley Xinlei Wang. M.Eng. 2010-03-25T15:02:56Z 2010-03-25T15:02:56Z 2009 2009 Thesis http://hdl.handle.net/1721.1/53114 503114771 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 90 p. application/pdf Massachusetts Institute of Technology
spellingShingle	Electrical Engineering and Computer Science. Wang, Stanley Xinlei Using graphone models in automatic speech recognition
title	Using graphone models in automatic speech recognition
title_full	Using graphone models in automatic speech recognition
title_fullStr	Using graphone models in automatic speech recognition
title_full_unstemmed	Using graphone models in automatic speech recognition
title_short	Using graphone models in automatic speech recognition
title_sort	using graphone models in automatic speech recognition
topic	Electrical Engineering and Computer Science.
url	http://hdl.handle.net/1721.1/53114
work_keys_str_mv	AT wangstanleyxinlei usinggraphonemodelsinautomaticspeechrecognition

Using graphone models in automatic speech recognition

Similar Items