Off-Chip Memory Allocation for Neural Processing Units

Many modern Systems-on-Chip (SoCs) are equipped with specialized Machine Learning (ML) accelerators that use both on-chip and off-chip memory to execute neural networks. While on-chip memory usually has a hard limit, off-chip memory is often considered large enough to hold the network’s i...

Full description

Bibliographic Details
Main Authors: Andrey Kvochko, Evgenii Maltsev, Artem Balyshev, Stanislav Malakhov, Alexander Efimov
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10388314/