Neuron-by-Neuron Quantization for Efficient Low-Bit QNN Training

Quantized neural networks (QNNs) are widely used to achieve computationally efficient solutions to recognition problems. Overall, eight-bit QNNs have almost the same accuracy as full-precision networks, but working several times faster. However, the networks with lower quantization levels demonstrat...

Full description

Bibliographic Details
Main Authors:	Artem Sher, Anton Trusov, Elena Limonova, Dmitry Nikolaev, Vladimir V. Arlazarov
Format:	Article
Language:	English
Published:	MDPI AG 2023-04-01
Series:	Mathematics
Subjects:	quantized neural network low-bit quantization layer-by-layer neuron-by-neuron training
Online Access:	https://www.mdpi.com/2227-7390/11/9/2112

Internet

https://www.mdpi.com/2227-7390/11/9/2112

Neuron-by-Neuron Quantization for Efficient Low-Bit QNN Training

Internet

Similar Items