Recognizing Speech with Large Language Models

Recognizing Speech with Large Language Models

Recent work has shown that large language models can be made to parse the contents of non-text embeddings and use those contents to perform various tasks. However, work focusing on audio inputs to large language models has thus far focused on either training a joint audio-text model from scratch on...

Full description

Bibliographic Details
Main Author:	Zeitoun, Abbas
Other Authors:	Kim, Yoon
Format:	Thesis
Published:	Massachusetts Institute of Technology 2023
Online Access:	https://hdl.handle.net/1721.1/151573

Similar Items

A virtual vocabulary speech recognizer
by: Pathe, Peter D
Published: (2013)

Recognizing intonational patterns in English speech
by: Panttaja, Erin Marie, 1975-
Published: (2009)

Performance of adapting non-native speech in isolated speech recognizer
by: N., Seman, et al.
Published: (2008)

Discriminative training of acoustic models in a segment-based speech recognizer
by: Sandness, Eric D. (Eric David), 1979-
Published: (2014)

Punctuation restoration for speech transcripts using large language models
by: Liu, Changsong
Published: (2024)

Infrastructure development for integration of lip reading into the SUMMIT Speech Recognizer
by: La, Chia-Hao, 1980-
Published: (2006)

Incorporating a feature tree geometry into a matcher for a speech recognizer
by: Maldonado, Aaron (Aaron Theodore), 1976-
Published: (2005)

A 6 mW, 5,000-Word Real-Time Speech Recognizer Using WFST Models
by: Chandrakasan, Anantha P., et al.
Published: (2016)

Recognizing text on maps
by: Goel, Tejas
Published: (2023)

Recognizing texts on maps
by: Tan, Pheng Khai
Published: (2024)

Recognizing indoor scenes
by: Quattoni, Ariadna, et al.
Published: (2010)

Recognizing Indoor Scenes
by: Torralba, Antonio, et al.
Published: (2004)

Recognizing Facial Slivers
by: Gilad-Gutnick, Sharon, et al.
Published: (2020)

Cross-lingual phone mapping for large vocabulary speech recognition of under-resourced languages
by: Do, Van Hai, et al.
Published: (2014)

Recognizing unknown objects with attributes relationship model
by: Hoo, Wai Lam, et al.
Published: (2015)

Automatic Acquisition of Language Models for Speech Recognition
by: McCandless, Michael Kyle
Published: (2023)

Automatic acquisition of language models for speech recognition
by: McCandless, Michael Kyle
Published: (2007)

Truthfulness in Large Language Models
by: Liu, Kevin
Published: (2023)

Recognizing Three-Dimensional Objects without the Use of Models
by: Marill, Thomas
Published: (2004)

Recognizing Interspersed sketches quickly
by: Hammond, Tracy A., et al.
Published: (2012)

Bayesian neural network language modeling for speech recognition
by: Xue, Boyang, et al.
Published: (2023)

Large-margin Gaussian mixture modeling for automatic speech recognition
by: Chang, Hung-An, Ph. D. Massachusetts Institute of Technology
Published: (2009)

Inference acceleration of large language models
by: Zhang, Boyu
Published: (2024)

A speech recognition module for speech-to-text language translation
by: Mwanyoha, Sadiki Pili, 1974-
Published: (2005)

Language generation and speech synthesis in dialogues for language learning
by: Zhang, Julia, 1981-
Published: (2005)

Language model domain adaptation for automatic speech recognition systems
by: Khassanov, Yerbolat
Published: (2020)

Another way to recognize human action
by: Abdullah, Lili Nurliyana, et al.
Published: (2007)

Detecting and recognizing human action in videos
by: Yu, Gang
Published: (2014)

Recognizing sound-event by machine learning
by: Athaariq Ramadino
Published: (2020)

Perceiving and recognizing three-dimensional forms
by: Sinha, Pawan
Published: (2005)

Recognizing actions using embodiment & empathy
by: McIntyre, Robert Louis
Published: (2014)

Preprocessor for Programs which Recognize Scenes
by: Mahabala, H.N.
Published: (2004)

Dataset of Chinese language beginning learners reading speech and text-to-speech
by: Yoke Lian Lau
Published: (2023)

PDPIPS technique in Malay language speech
by: Mohd Kiram, Norazlina
Published: (2023)

Gender bias and stereotypes in Large Language Models
by: Kotek, Hadas, et al.
Published: (2023)

Revolutionising portfolio management with large language model
by: Kee, Kai Teng
Published: (2024)

Personality prediction based on large language models
by: Wee, Jewel Xin Yu
Published: (2024)

Developing locally trainable large language models
by: Chen, Hailin
Published: (2025)

Medical chatbot interface for large language models
by: Chua, Yu Hao
Published: (2024)

Advances and applications of large language models II
by: Ng, Qi Xuan
Published: (2024)