Loop tiling in large-scale stencil codes at run-time with OPS
The key common bottleneck in most stencil codes is data movement, and prior research has shown that improving data locality through optimisations that schedule across loops do particularly well. However, in many large PDE applications it is not possible to apply such optimisations through compilers...
Main Authors: | Reguly, I, Mudalige, G, Giles, M |
---|---|
Format: | Journal article |
Published: |
Institute of Electrical and Electronics Engineers
2017
|
Similar Items
-
Beyond 16GB: Out-of-core stencil computations
by: Reguly, I, et al.
Published: (2017) -
OP2-TPDS2015-DATA
by: Mudalige, G, et al.
Published: (2015) -
Auto-vectorizing a large-scale production unstructured-mesh CFD application
by: Mudalige, G, et al.
Published: (2016) -
Designing OP2 for GPU architectures
by: Giles, M, et al.
Published: (2013) -
Prediction of Optimal Loop Tiling Size for stencil Computation Based on Neural Network Model
by: BAO Yi-kun, ZHANG Peng, XU Xiao-wen, MO Ze-yao
Published: (2022-10-01)