Performance Evaluation of INT8 Quantized Inference on Mobile GPUs

During the past several years, the need for on-device deep learning has rapidly increased, and the performance of mobile GPUs has significantly increased. As a viable approach for efficient on-device deep learning, INT8 quantized inference has been actively studied and proposed but there are current...

Full description

Bibliographic Details
Main Authors: Sumin Kim, Gunju Park, Youngmin Yi
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9638444/