AE-Qdrop: Towards Accurate and Efficient Low-Bit Post-Training Quantization for A Convolutional Neural Network

AE-Qdrop: Towards Accurate and Efficient Low-Bit Post-Training Quantization for A Convolutional Neural Network

Blockwise reconstruction with adaptive rounding helps achieve acceptable 4-bit post-training quantization accuracy. However, adaptive rounding is time intensive, and the optimization space of weight elements is constrained to a binary set, thus limiting the performance of quantized models. The optim...

Full description

Bibliographic Details
Main Authors:	Jixing Li, Gang Chen, Min Jin, Wenyu Mao, Huaxiang Lu
Format:	Article
Language:	English
Published:	MDPI AG 2024-02-01
Series:	Electronics
Subjects:	post-training quantization adaptive rounding block-wise reconstruction progressive optimization strategy randomly weighted quantized activation global fine-tuning
Online Access:	https://www.mdpi.com/2079-9292/13/3/644

Similar Items

Clipping-Based Post Training 8-Bit Quantization of Convolution Neural Networks for Object Detection
by: Leisheng Chen, et al.
Published: (2022-12-01)

Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer
by: Phuoc Pham, et al.
Published: (2021-01-01)

Super-Resolution Model Quantized in Multi-Precision
by: Jingyu Liu, et al.
Published: (2021-09-01)

Two Novel Non-Uniform Quantizers with Application in Post-Training Quantization
by: Zoran Perić, et al.
Published: (2022-09-01)

Partial Encryption of Co mpressed Image Using Threshold Quantization and AES Cipher
by: H.A.Younis, et al.
Published: (2012-01-01)

Entropy-Constrained Scalar Quantization with a Lossy-Compressed Bit
by: Melanie F. Pradier, et al.
Published: (2016-12-01)

Neuron-by-Neuron Quantization for Efficient Low-Bit QNN Training
by: Artem Sher, et al.
Published: (2023-04-01)

The Quantization of Gravity: Quantization of the Hamilton Equations
by: Claus Gerhardt
Published: (2021-04-01)

4.6-Bit Quantization for Fast and Accurate Neural Network Inference on CPUs
by: Anton Trusov, et al.
Published: (2024-02-01)

Non-Zero Grid for Accurate 2-Bit Additive Power-of-Two CNN Quantization
by: Young Min Kim, et al.
Published: (2023-01-01)

Unmanned ship heading tracking control strategy with state quantization and input quantization
by: Wei LI, et al.
Published: (2024-02-01)

Three Natural Generalizations of Fedosov Quantization
by: Klaus Bering
Published: (2009-03-01)

Quantization for Infinite Affine Transformations
by: Doğan Çömez, et al.
Published: (2022-04-01)

How to Secure Valid Quantizations
by: John R. Klauder
Published: (2022-09-01)

Latitude-Adaptive Integer Bit Allocation for Quantization of Omnidirectional Images
by: Qian Sima, et al.
Published: (2024-02-01)

Optimization of the Sampling Periods and the Quantization Bit Lengths for Networked Estimation
by: Young Soo Suh, et al.
Published: (2010-06-01)

An Image-Based Quantized Compressive Sensing Scheme Using Zadoff–Chu Measurement Matrix
by: Linlin Xue, et al.
Published: (2023-01-01)

Learning Low-Precision Structured Subnetworks Using Joint Layerwise Channel Pruning and Uniform Quantization
by: Xinyu Zhang, et al.
Published: (2022-08-01)

Design of a 2-Bit Neural Network Quantizer for Laplacian Source
by: Zoran Perić, et al.
Published: (2021-07-01)

Bit-Weight Adjustment for Bridging Uniform and Non-Uniform Quantization to Build Efficient Image Classifiers
by: Xichuan Zhou, et al.
Published: (2023-12-01)

Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source
by: Jelena Nikolić, et al.
Published: (2021-11-01)

GradFreeBits: Gradient-Free Bit Allocation for Mixed-Precision Neural Networks
by: Benjamin Jacob Bodner, et al.
Published: (2022-12-01)

Approximate Nearest Neighbor Search Using Enhanced Accumulative Quantization
by: Liefu Ai, et al.
Published: (2022-07-01)

Closed-Form Sum-Rate Analysis of Interference Alignment with Limited Feedback Based on Scalar Quantization and Random Vector Quantization
by: Long Suo, et al.
Published: (2022-06-01)

Skyrmion helicity: quantization and quantum tunneling effects
by: Psaroudaki, Christina, et al.
Published: (2023)

Analysis of Received Signal Strength Quantization in Fingerprinting Localization
by: Syed Khandker, et al.
Published: (2020-06-01)

Quantization-Aware NN Layers with High-throughput FPGA Implementation for Edge AI
by: Mara Pistellato, et al.
Published: (2023-05-01)

A Brief Overview of Results about Uniqueness of the Quantization in Cosmology
by: Jerónimo Cortez, et al.
Published: (2021-08-01)

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?
by: Jelena Nikolić, et al.
Published: (2021-12-01)

Gaussian Multiple Access Channels with One-Bit Quantizer at the Receiver †,‡
by: Borzoo Rassouli, et al.
Published: (2018-09-01)

Soft Quantization Using Entropic Regularization
by: Rajmadan Lakshmanan, et al.
Published: (2023-10-01)

Design of dither waveforms for quantized visuals signals /
by: 240132 Limb, J.O.

Reconfigurable Antenna Array Testbed for Quantized Controlling
by: Michal Pokorny, et al.
Published: (2024-01-01)

Comprehensive Comparisons of Uniform Quantization in Deep Image Compression
by: Koki Tsubota, et al.
Published: (2023-01-01)

CANET: Quantized Neural Network Inference With 8-bit Carry-Aware Accumulator
by: Jingxuan Yang, et al.
Published: (2024-01-01)

Quantized Output Observer-based Data Driven Model-free Adaptive Control
by: Bing Ren, et al.
Published: (2023-08-01)

Integer-Only CNNs with 4 Bit Weights and Bit-Shift Quantization Scales at Full-Precision Accuracy
by: Maarten Vandersteegen, et al.
Published: (2021-11-01)

Shifting Operators in Geometric Quantization
by: Richard Cushman, et al.
Published: (2020-10-01)

Quantization of Gravitationally Bound Systems
by: Michael Fil’chenkov, et al.
Published: (2021-01-01)

A Unified Quantization of Gravity and Other Fundamental Forces of Nature
by: Claus Gerhardt
Published: (2022-08-01)