Recognizing Speech with Large Language Models
Recent work has shown that large language models can be made to parse the contents of non-text embeddings and use those contents to perform various tasks. However, work focusing on audio inputs to large language models has thus far focused on either training a joint audio-text model from scratch on...
Main Author: | Zeitoun, Abbas |
---|---|
Other Authors: | Kim, Yoon |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2023
|
Online Access: | https://hdl.handle.net/1721.1/151573 |
Similar Items
-
A virtual vocabulary speech recognizer
by: Pathe, Peter D
Published: (2013) -
Recognizing intonational patterns in English speech
by: Panttaja, Erin Marie, 1975-
Published: (2009) -
Performance of adapting non-native speech in isolated speech recognizer
by: N., Seman, et al.
Published: (2008) -
Discriminative training of acoustic models in a segment-based speech recognizer
by: Sandness, Eric D. (Eric David), 1979-
Published: (2014) -
Punctuation restoration for speech transcripts using large language models
by: Liu, Changsong
Published: (2024)