CANET: Quantized Neural Network Inference With 8-bit Carry-Aware Accumulator

Neural network quantization represents weights and activations with few bits, greatly reducing the overhead of multiplications. However, due to the recursive accumulation operations, high-precision accumulators are still required in multiply-accumulate (MAC) units to avoid overflow, incurring signif...

Full description

Bibliographic Details
Main Authors: Jingxuan Yang, Xiaoqin Wang, Yiying Jiang
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10445180/