Finding Sparse Subnetworks in Self-Supervised Speech Recognition and Speech Synthesis

The modern paradigm in speech processing has demonstrated the importance of scale and compute for end-to-end speech recognition and synthesis. For instance, state-of-the-art self-supervised speech representation learning models typically consists of more than 300M model parameters and being trained...

Full description

Bibliographic Details
Main Author: Lai, Cheng-I Jeff
Other Authors: Glass, James R.
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/144615