PartitionTuner: An operator scheduler for deep-learning compilers supporting multiple heterogeneous processing units

Recently, embedded systems, such as mobile platforms, have multiple proces sing units that can operate in parallel, such as centralized processing units (CPUs) and neural processing units (NPUs). We can use deep-learning compilers to generate machine code optimized for these embedded systems from a...

Full description

Bibliographic Details
Main Authors: Misun Yu, Yongin Kwon, Jemin Lee, Jeman Park, Junmo Park, Taeho Kim
Format: Article
Language:English
Published: Electronics and Telecommunications Research Institute (ETRI) 2023-04-01
Series:ETRI Journal
Subjects:
Online Access:https://doi.org/10.4218/etrij.2021-0446