Complexity-Aware Layer-Wise Mixed-Precision Schemes With SQNR-Based Fast Analysis

Recently, deep neural network (DNN) acceleration has been critical for hardware systems from mobile/edge devices to high-performance data centers. Especially, for on-device AI, there have been many studies on hardware numerical precision reduction considering the limited hardware resources of mobile...

Full description

Bibliographic Details
Main Authors: Hana Kim, Hyun Eun, Jung Hwan Choi, Ji-Hoon Kim
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10287357/