Integer-Only CNNs with 4 Bit Weights and Bit-Shift Quantization Scales at Full-Precision Accuracy

Integer-Only CNNs with 4 Bit Weights and Bit-Shift Quantization Scales at Full-Precision Accuracy

Quantization of neural networks has been one of the most popular techniques to compress models for embedded (IoT) hardware platforms with highly constrained latency, storage, memory-bandwidth, and energy specifications. Limiting the number of bits per weight and activation has been the main focus in...

Full description

Bibliographic Details
Main Authors:	Maarten Vandersteegen, Kristof Van Beeck, Toon Goedemé
Format:	Article
Language:	English
Published:	MDPI AG 2021-11-01
Series:	Electronics
Subjects:	quantization neural networks nonuniform power-of-two scales low-cost hardware
Online Access:	https://www.mdpi.com/2079-9292/10/22/2823

Similar Items

A Hardware-Friendly Low-Bit Power-of-Two Quantization Method for CNNs and Its FPGA Implementation
by: Xuefu Sui, et al.
Published: (2022-09-01)

Latitude-Adaptive Integer Bit Allocation for Quantization of Omnidirectional Images
by: Qian Sima, et al.
Published: (2024-02-01)

Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer
by: Phuoc Pham, et al.
Published: (2021-01-01)

Entropy-Constrained Scalar Quantization with a Lossy-Compressed Bit
by: Melanie F. Pradier, et al.
Published: (2016-12-01)

Neuron-by-Neuron Quantization for Efficient Low-Bit QNN Training
by: Artem Sher, et al.
Published: (2023-04-01)

GradFreeBits: Gradient-Free Bit Allocation for Mixed-Precision Neural Networks
by: Benjamin Jacob Bodner, et al.
Published: (2022-12-01)

Unified Scaling-Based Pure-Integer Quantization for Low-Power Accelerator of Complex CNNs
by: Ali A. Al-Hamid, et al.
Published: (2023-06-01)

Clipping-Based Post Training 8-Bit Quantization of Convolution Neural Networks for Object Detection
by: Leisheng Chen, et al.
Published: (2022-12-01)

Optimization of the Sampling Periods and the Quantization Bit Lengths for Networked Estimation
by: Young Soo Suh, et al.
Published: (2010-06-01)

Design of a 2-Bit Neural Network Quantizer for Laplacian Source
by: Zoran Perić, et al.
Published: (2021-07-01)

Gaussian Multiple Access Channels with One-Bit Quantizer at the Receiver †,‡
by: Borzoo Rassouli, et al.
Published: (2018-09-01)

Super-Resolution Model Quantized in Multi-Precision
by: Jingyu Liu, et al.
Published: (2021-09-01)

4.6-Bit Quantization for Fast and Accurate Neural Network Inference on CPUs
by: Anton Trusov, et al.
Published: (2024-02-01)

Bit-Weight Adjustment for Bridging Uniform and Non-Uniform Quantization to Build Efficient Image Classifiers
by: Xichuan Zhou, et al.
Published: (2023-12-01)

CANET: Quantized Neural Network Inference With 8-bit Carry-Aware Accumulator
by: Jingxuan Yang, et al.
Published: (2024-01-01)

Non-Zero Grid for Accurate 2-Bit Additive Power-of-Two CNN Quantization
by: Young Min Kim, et al.
Published: (2023-01-01)

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
by: Jelena Nikolić, et al.
Published: (2021-12-01)

Optimization of Linear Quantization for General and Effective Low Bit-Width Network Compression
by: Wenxin Yang, et al.
Published: (2023-01-01)

A Reconfigurable Power-Efficient Quantized Analog RF Front-End With Smart Calibration
by: Justin Yonghui Kim, et al.
Published: (2022-01-01)

O-2A: Outlier-Aware Compression for 8-bit Post-Training Quantization Model
by: Nguyen-Dong Ho, et al.
Published: (2023-01-01)

Direct Position Determination for Massive MIMO System with One-bit Quantization
by: Guoxin ZHANG, et al.
Published: (2021-12-01)

AE-Qdrop: Towards Accurate and Efficient Low-Bit Post-Training Quantization for A Convolutional Neural Network
by: Jixing Li, et al.
Published: (2024-02-01)

State Machine-Based Waveforms for Channels With 1-bit Quantization and Oversampling With Time-Instance Zero-Crossing Modulation
by: Diana Marcela V. Melo, et al.
Published: (2024-01-01)

Shifting Operators in Geometric Quantization
by: Richard Cushman, et al.
Published: (2020-10-01)

Lightweight SAR: A Two-Bit Strategy
by: Shiqi Liu, et al.
Published: (2023-01-01)

The Quantization of Gravity: Quantization of the Hamilton Equations
by: Claus Gerhardt
Published: (2021-04-01)

A reconfigurable processor for mix-precision CNNs on FPGA
by: CHANG Libo, et al.
Published: (2022-04-01)

Two Novel Non-Uniform Quantizers with Application in Post-Training Quantization
by: Zoran Perić, et al.
Published: (2022-09-01)

Enhancing Embedded Object Tracking: A Hardware Acceleration Approach for Real-Time Predictability
by: Mingyang Zhang, et al.
Published: (2024-03-01)

Learning Low-Precision Structured Subnetworks Using Joint Layerwise Channel Pruning and Uniform Quantization
by: Xinyu Zhang, et al.
Published: (2022-08-01)

The Quantization of Gravity: The Quantization of the Full Einstein Equations
by: Claus Gerhardt
Published: (2023-08-01)

Unmanned ship heading tracking control strategy with state quantization and input quantization
by: Wei LI, et al.
Published: (2024-02-01)

A Generalized One-Bit Control System Using a <inline-formula> <tex-math notation="LaTeX">$\Delta\Sigma$ </tex-math></inline-formula>-Quantizer
by: Dhafer J. Almakhles, et al.
Published: (2019-01-01)

Optimizing the Energy Efficiency of Unreliable Memories for Quantized Kalman Filtering
by: Jonathan Kern, et al.
Published: (2022-01-01)

Training and Inference of Optical Neural Networks with Noise and Low-Bits Control
by: Danni Zhang, et al.
Published: (2021-04-01)

CloudSatNet-1: FPGA-Based Hardware-Accelerated Quantized CNN for Satellite On-Board Cloud Coverage Classification
by: Radoslav Pitonak, et al.
Published: (2022-07-01)

Photonics-Assisted Millimeter-Wave Communication System Based on Low-Bit Gaussian Mixture Model Adaptive Vector Quantization
by: Yuancheng Cai, et al.
Published: (2022-01-01)

Three Natural Generalizations of Fedosov Quantization
by: Klaus Bering
Published: (2009-03-01)

Learning Bilateral Clipping Parametric Activation for Low-Bit Neural Networks
by: Yunlong Ding, et al.
Published: (2023-04-01)

Quantization for Infinite Affine Transformations
by: Doğan Çömez, et al.
Published: (2022-04-01)