AMED: Automatic Mixed-Precision Quantization for Edge Devices

Quantized neural networks are well known for reducing the latency, power consumption, and model size without significant harm to the performance. This makes them highly appropriate for systems with limited resources and low power capacity. Mixed-precision quantization offers better utilization of cu...

Full description

Bibliographic Details
Main Authors: Moshe Kimhi, Tal Rozen, Avi Mendelson, Chaim Baskin
Format: Article
Language:English
Published: MDPI AG 2024-06-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/12/12/1810