PartitionTuner: An operator scheduler for deep-learning compilers supporting multiple heterogeneous processing units
Recently, embedded systems, such as mobile platforms, have multiple proces sing units that can operate in parallel, such as centralized processing units (CPUs) and neural processing units (NPUs). We can use deep-learning compilers to generate machine code optimized for these embedded systems from a...
Main Authors: | Misun Yu, Yongin Kwon, Jemin Lee, Jeman Park, Junmo Park, Taeho Kim |
---|---|
Format: | Article |
Language: | English |
Published: |
Electronics and Telecommunications Research Institute (ETRI)
2023-04-01
|
Series: | ETRI Journal |
Subjects: | |
Online Access: | https://doi.org/10.4218/etrij.2021-0446 |
Similar Items
-
TAFFO: The compiler-based precision tuner
by: Daniele Cattaneo, et al.
Published: (2022-12-01) -
Tensor Instruction Generation Optimization Fusing with Loop Partitioning
by: LIANG Jiali, HUA Baojian, SU Shaobo
Published: (2023-02-01) -
High performance compilers for parallel computing /
by: 372416 Wolfe, Michael
Published: (1996) -
MIXL compiler : lexical analysis and partitioning of static iterations /
by: Nazib Nordin, author
Published: (1982) -
A Parallelizing Compiler Based on Partial Evaluation
by: Surati, Rajeev
Published: (2004)