Automatic Partitioning of Parallel Loops and Data Arrays for Distributed Shared-memory Multiprocessors

This paper presents a theoretical framework for automatically partitioning parallel loops to minimize cache coherency traffic on shared-memory multiprocessors. While several previous papers have looked at hyperplane partitioning of iteration spaces to reduce communication traffic, the problem of de...

Full description

Bibliographic Details
Main Authors: Agarwal, Anant, Kranz, David A., Natarajan, Venkat
Published: 2023
Online Access:https://hdl.handle.net/1721.1/149251