PartitionTuner: An operator scheduler for deep-learning compilers supporting multiple heterogeneous processing units

Recently, embedded systems, such as mobile platforms, have multiple proces sing units that can operate in parallel, such as centralized processing units (CPUs) and neural processing units (NPUs). We can use deep-learning compilers to generate machine code optimized for these embedded systems from a...

Description complète

Détails bibliographiques
Auteurs principaux: Misun Yu, Yongin Kwon, Jemin Lee, Jeman Park, Junmo Park, Taeho Kim
Format: Article
Langue:English
Publié: Electronics and Telecommunications Research Institute (ETRI) 2023-04-01
Collection:ETRI Journal
Sujets:
Accès en ligne:https://doi.org/10.4218/etrij.2021-0446