Off-Chip Memory Allocation for Neural Processing Units

Many modern Systems-on-Chip (SoCs) are equipped with specialized Machine Learning (ML) accelerators that use both on-chip and off-chip memory to execute neural networks. While on-chip memory usually has a hard limit, off-chip memory is often considered large enough to hold the network’s i...

Full description

Bibliographic Details
Main Authors:	Andrey Kvochko, Evgenii Maltsev, Artem Balyshev, Stanislav Malakhov, Alexander Efimov
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	NPU memory allocation neural network runtime tiling strip-packing problem
Online Access:	https://ieeexplore.ieee.org/document/10388314/

Internet

https://ieeexplore.ieee.org/document/10388314/

Off-Chip Memory Allocation for Neural Processing Units

Internet

Similar Items