Automatic Partitioning of Parallel Loops for Cache-coherent Multiprocessors

This paper presents a theoretical framework for automatically partitioning parallel loops to minimize cache coherency traffic on shared-memory multiprocessors. The framework introduces the notion of uniformly intersecting references to capture temporal locality in array references, and the idea of...

Full description

Bibliographic Details
Main Authors: Agarwal, Anant, Kranz, David, Natarajan, Venkat
Published: 2023
Online Access:https://hdl.handle.net/1721.1/149205