Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization
We consider the problem of accurate quantization for language models, where both the weights and activations are quantized to 4 bits per parameter with uniform quantization, the lowest bitwidth format natively supported by existing GPU hardware. In this context, the key challenge is activation quant...
Main Author: | Nrusimha, Aniruddha |
---|---|
Other Authors: | Kim, Yoon |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2024
|
Online Access: | https://hdl.handle.net/1721.1/156280 |
Similar Items
-
Avoided level crossings in the quantization of a mixed regular-chaotic system.
by: Mainiero, T, et al.
Published: (2007) -
Beyond market and politics--changing regularization policies towards unauthorized colonies in Dehli
by: Dasgupta, Aniruddha, 1964-
Published: (2011) -
Quantization Games on Social Networks and Language Evolution
by: Mani, Ankur, et al.
Published: (2021) -
Quantization Games on Social Networks and Language Evolution
by: Mani, Ankur, et al.
Published: (2022) -
Mitigating quantization effects on distributed sensor fusion : a least squares approach
by: Zhu, Shanying, et al.
Published: (2020)