-
1
Evaluating Kernels on Xeon Phi to accelerate Gysela application
Published 2016-03-01“…This work describes the challenges presented by porting parts of the Gysela code to the Intel Xeon Phi coprocessor, as well as techniques used for optimization, vectorization and tuning that can be applied to other applications. …”
Get full text
Article -
2
Improving main memory hash joins on Intel Xeon Phi processors
Published 2018Subjects: Get full text
Get full text
Journal Article -
3
MPI communication benchmarking on Intel Xeon dual quad-core processor cluster
Published 2011“…This paper reports the measurements of MPI communication benchmarking on Khaldun cluster which ran on Linux-based IBM Blade HS21 Servers with Intel Xeon dual quad-core processor and Gigabit Ethernet interconnect. …”
Get full text
Conference or Workshop Item -
4
Evaluating New Architectural Features Of The Intel(R) Xeon(R) 7500 Processor For Hpc Workloads
Published 2011-01-01“…We compare two families of Intel Xeonbased systems (Intel Xeon 7500 and Intel Xeon 5600) and present a performance evolutionof 16 node clusters based on these CPUs. …”
Get full text
Article -
5
Evaluation of the Intel Xeon Phi 7120 and NVIDIA K80 as accelerators for two-dimensional panel codes.
Published 2017-01-01“…In this paper we evaluate the performance of such an optimization algorithm on modern accelerators (more specifically, the Intel Xeon Phi 7120 and the NVIDIA K80). For that purpose, we have implemented an optimized version of the algorithm on the CPU and Xeon Phi (based on OpenMP, vectorization, and the Intel MKL library) and on the GPU (based on CUDA and the MAGMA library). …”
Get full text
Article -
6
Correction: Memory hierarchy characterization of SPEC CPU2006 and SPEC CPU2017 on the Intel Xeon Skylake-SP
Published 2024-01-01Get full text
Article -
7
-
8
Many-core needs fine-grained scheduling : a case study of query processing on Intel Xeon Phi processors
Published 2020“…Emerging many-core processors feature very high memory bandwidth and computational power. For example, Intel Xeon Phi many-core processors of the Knights Corner (KNC) and Knights Landing (KNL) architectures embrace 60 to 64 x86-based CPU cores with 512-bit SIMD capabilities and high-bandwidth memories like the GDDR5 on KNC and on-package DRAMs on KNL. …”
Get full text
Journal Article -
9
Performance analysis of Message Passing Interface collective communication on intel xeon quad-core gigabit ethernet and infiniband clusters
Published 2013“…Consequently, this study concentrates on benchmarking MPI implementation on multi-core architecture by measuring the performance of Open MPI collective communication on Intel Xeon dual quad-core Gigabit Ethernet and InfiniBand clusters using SKaMPI. …”
Get full text
Article -
10
Performance Evaluation of Massively Parallel Systems Using SPEC OMP Suite
Published 2022-05-01Subjects: Get full text
Article -
11
A Comparative Study of Methods for Measurement of Energy of Computing
Published 2019-06-01Subjects: Get full text
Article -
12
Plasma Physics
Published 2010Subjects: “…Anomalous Pulsed Emission and Absorption by a Xeon Plasma at the Electron-Cyclotron Frequency…”
Get full text
Technical Report -
13
Software prefetching for unstructured mesh applications
Published 2020“…Through this, we show significant full application speed-ups on a range of processors and realistic test cases in both single core/tile and full socket configurations, such as 1.14× on the Intel Xeon Sandy Bridge, 1.09× on the Intel Xeon Broadwell, 1.29× on the Intel Xeon Skylake, 1.99× on the in-order Intel Xeon Phi Knights Corner coprocessor, and 1.51× on the out-of-order Intel Xeon Phi Knights Landing many-core processor.…”
Journal article -
14
Vectorizing unstructured mesh computations for many-core architectures
Published 2015“…This paper presents results on achieving high performance through vectorization on CPUs and the Xeon-Phi on a key class of irregular applications: unstructured mesh computations. …”
Journal article -
15
Some useful optimisations for unstructured computational fluid dynamics codes on multicore and manycore architectures
Published 2018“…This paper presents a number of optimisations for improving the performance of unstructured computational fluid dynamics codes on multicore and manycore architectures such as the Intel Sandy Bridge, Broadwell and Skylake CPUs and the Intel Xeon Phi Knights Corner and Knights Landing manycore processors. …”
Journal article -
16
The VOLNA-OP2 tsunami code (version 1.5)
Published 2018“…The scalability of the code is demonstrated on three supercomputers, built with classical Xeon CPUs, the Intel Xeon Phi, and NVIDIA P100 GPUs. …”
Journal article -
17
A polyphase filter for many-core architectures
Published 2016“…We describe in detail our implementation of the polyphase filter algorithm and its behaviour on three generations of NVIDIA GPU cards (Fermi, Kepler, Maxwell), on the Intel Xeon CPU and Xeon Phi (Knights Corner) platforms. …”
Journal article -
18
Modern multicore and manycore architectures: Modelling, optimisation and benchmarking a multiblock CFD code
Published 2016“…We report significant speedups for single thread execution across both kernels: 2-5X on the multicore CPUs and 14-23X on the Xeon Phi coprocessor. Computations at full node and chip concurrency deliver a factor of three speedup on the multicore processors and up to 24X on the Xeon Phi manycore coprocessor.…”
Journal article -
19
Energy Efficiency of a New Parallel PIC Code for Numerical Simulation of Plasma Dynamics in Open Trap
Published 2022-10-01“…This code can be auto-vectorized by the Fortran compiler for Intel Xeon processors with AVX-512 instructions such as Intel Xeon Phi and the highest series of all generations of Intel Xeon Scalable processors. …”
Get full text
Article -
20
NOHOST : a new storage architecture for distributed storage systems
Published 2017Get full text
Thesis