HyperBlock floating point: generalised quantization scheme for gradient and inference computation

Prior quantization methods focus on producing networks for fast and lightweight inference. However, the cost of unquantised training is overlooked, despite requiring significantly more time and energy than inference. We present a method for quantizing convolutional neural networks for efficient trai...

Ful tanımlama

Detaylı Bibliyografya
Asıl Yazarlar: Gennari do Nascimento, M, Adrian Prisacariu, V, Fawcett, R, Langhammer, M
Materyal Türü: Conference item
Dil:English
Baskı/Yayın Bilgisi: IEEE 2023