Hardware-Centric AutoML for Mixed-Precision Quantization
Abstract Model quantization is a widely used technique to compress and accelerate deep neural network (DNN) inference. Emergent DNN hardware accelerators begin to support flexible bitwidth (1–8 bits) to further improve the computation efficiency, which raises a great challenge to find the optimal b...
Main Authors: | Wang, Kuan, Liu, Zhijian, Lin, Yujun, Lin, Ji, Han, Song |
---|---|
Other Authors: | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science |
Format: | Article |
Language: | English |
Published: |
Springer US
2021
|
Online Access: | https://hdl.handle.net/1721.1/131513 |
Similar Items
-
HAQ: Hardware-Aware Automated Quantization With Mixed Precision
by: Wang, Kuan, et al.
Published: (2021) -
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
by: Lin, Ji, et al.
Published: (2021) -
AutoGenome: An AutoML tool for genomic research
by: Denghui Liu, et al.
Published: (2021-12-01) -
Assisting Educational Analytics with AutoML Functionalities
by: Spyridon Garmpis, et al.
Published: (2022-06-01) -
Multimodal AutoML via Representation Evolution
by: Blaž Škrlj, et al.
Published: (2022-12-01)