Recognizing Speech with Large Language Models

Recent work has shown that large language models can be made to parse the contents of non-text embeddings and use those contents to perform various tasks. However, work focusing on audio inputs to large language models has thus far focused on either training a joint audio-text model from scratch on...

Full description

Bibliographic Details
Main Author:	Zeitoun, Abbas
Other Authors:	Kim, Yoon
Format:	Thesis
Published:	Massachusetts Institute of Technology 2023
Online Access:	https://hdl.handle.net/1721.1/151573

Internet

https://hdl.handle.net/1721.1/151573

Recognizing Speech with Large Language Models

Internet

Similar Items