Demand MemCpy: Overlapping of Computation and Data Transfer for Heterogeneous Computing

Heterogeneous computing relies on collaboration among different types of processors on shared data. In systems with discrete accelerators (e.g., GP-GPU), data sharing requires transferring a large amount of data between CPU and accelerator memories and can significantly increase the end-to-end execu...

Full description

Bibliographic Details
Main Authors: Donghun Jeong, Jihun Park, Jungrae Kim
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9845392/