Auto-vectorizing a large-scale production unstructured-mesh CFD application
For modern x86 based CPUs with increasingly longer vector lengths, achieving good vectorization has become very important for gaining higher performance. Using very explicit SIMD vector programming techniques has been shown to give near optimal performance, however they are difficult to implement fo...
Main Authors: | Mudalige, G, Reguly, I, Giles, M |
---|---|
Format: | Conference item |
Published: |
ACM
2016
|
Similar Items
-
Vectorizing unstructured mesh computations for many-core architectures
by: Giles, M, et al.
Published: (2015) -
Design and Performance of the OP2 Library for Unstructured Mesh Applications.
by: Bertolli, C, et al.
Published: (2011) -
Large-scale performance of a DSL-based multi-block structured-mesh application for direct numerical simulation
by: Mudalige, G, et al.
Published: (2019) -
Acceleration of a Full-scale Industrial CFD Application with OP2
by: Reguly, I, et al.
Published: (2015) -
Loop tiling in large-scale stencil codes at run-time with OPS
by: Reguly, I, et al.
Published: (2017)