Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

We consider the problem of accurate quantization for language models, where both the weights and activations are quantized to 4 bits per parameter with uniform quantization, the lowest bitwidth format natively supported by existing GPU hardware. In this context, the key challenge is activation quant...

Full description

Bibliographic Details
Main Author:	Nrusimha, Aniruddha
Other Authors:	Kim, Yoon
Format:	Thesis
Published:	Massachusetts Institute of Technology 2024
Online Access:	https://hdl.handle.net/1721.1/156280

Similar Items

Avoided level crossings in the quantization of a mixed regular-chaotic system.
by: Mainiero, T, et al.
Published: (2007)

Beyond market and politics--changing regularization policies towards unauthorized colonies in Dehli
by: Dasgupta, Aniruddha, 1964-
Published: (2011)

Quantization Games on Social Networks and Language Evolution
by: Mani, Ankur, et al.
Published: (2021)

Quantization Games on Social Networks and Language Evolution
by: Mani, Ankur, et al.
Published: (2022)

Mitigating quantization effects on distributed sensor fusion : a least squares approach
by: Zhu, Shanying, et al.
Published: (2020)

Quantization and Compensation in Sampled Interleaved Multi-Channel Systems
by: Maymon, Shay, et al.
Published: (2011)

Outlier Detection in GARCH Models.
by: Doornik, J, et al.
Published: (2005)

Spatial outlier accommodation using a spatial variance shift outlier model
by: Mohammed Baba, Ali, et al.
Published: (2022)

Channel state quantization in MIMO broadcast systems : architectures and codes
by: Swannack, Charles (Charles Henry)
Published: (2010)

Describing functions for information channels subject to packet loss and quantization
by: Gilbertson, Eric (Eric W.)
Published: (2015)

Outlier detection
by: Li, Shukai
Published: (2013)

Graphical Summaries of Circular Data with Outliers Using Python Programming Language
by: Nur Syahirah, Zulkipli, et al.
Published: (2021)

Descriptive analysis of circular data with outliers using Python programming language
by: N. S., Zulkipli, et al.
Published: (2020)

Outlier detection using statistical models
by: You, Yuan
Published: (2015)

Single-linkage method to detect multiple outliers with different outlier scenarios in circular regression model
by: Siti Zanariah, Satari, et al.
Published: (2018)

The outlier paradox: the role of iterative ensemble coding in discounting outliers
by: Epstein, M, et al.
Published: (2020)

Regularity Problems for Visibly Pushdown Languages
by: Barany, V, et al.
Published: (2006)

Bounded reparability for regular tree languages
by: Puppis, G, et al.
Published: (2012)

Thin and Thick Timed Regular Languages
by: Basset, N, et al.
Published: (2011)

Stochastic quantization
Published: (2003)

Functional quantization
by: Misra, Vinith
Published: (2009)

Quantized dislocations
by: Li, Mingda
Published: (2021)

Optimum quantization.
Published: (2004)

Branched Quantization
by: Shapere, Alfred, et al.
Published: (2012)

Timing offset and quantization error trade-off in interleaved multi-channel measurements
by: McMichael, Joseph Gary
Published: (2011)

Berurusan dengan Outliers
by: Widhiarso, Wahyu
Published: (2001)

Berurusan dengan Outliers
by: Widhiarso, Wahyu
Published: (2001)

Outlier evaluation for the bilinear time series model.
by: Mohamed, I.B., et al.
Published: (2008)

Detection of outliers in the complex linear regression model
by: Hussin, A.G., et al.
Published: (2013)

Outlier estimation and detection in time series model
by: Ang, Liyun, et al.
Published: (2008)

Tolerant Testing of Regular Languages in Sublinear Time
by: Gong, Linda
Published: (2022)

Regular Tree Languages Definable in FO and in FOmod
by: Benedikt, M, et al.
Published: (2009)

The complexity of regular abstractions of one-counter languages
by: Atig, M, et al.
Published: (2016)

The Taichi High-Performance and Differentiable Programming Language for Sparse and Quantized Visual Computing
by: Hu, Yuanming
Published: (2022)

No-Go theorems and quantization
by: Zainuddin, Hishamuddin, et al.
Published: (2007)

Information-Distilling Quantizers
by: Bhatt, Alankrita, et al.
Published: (2022)

Frame permutation quantization
by: Nguyen, Ha Q., et al.
Published: (2012)

An investigation of optimum quantization
by: Bruce, James Donald
Published: (2005)

Localization and the quantization conjecture
by: Jeffrey, L, et al.
Published: (1997)

Exploiting cross-channel quantizer error correlation in time-interleaved analog-to-digital converters
by: McMichael, Joseph G., et al.
Published: (2013)