High performance 2D convolution utilizing the AVX512 on a multi-core architecture

Convolution is a time consuming operation, especially for signal and image processing, which led us to develop an efficient implementation of 2D convolution for a multi-core architecture utilizing AVX512 intrinsics and OpenMP. For single precision convolution, our algorithm is on average 2.30, 3.8...

Full description

Bibliographic Details
Main Authors: Isamail Masamae, Panyayot Chaikan
Format: Article
Language:English
Published: Prince of Songkla University 2021-08-01
Series:Songklanakarin Journal of Science and Technology (SJST)
Subjects:
Online Access:https://rdo.psu.ac.th/sjst/journal/43-4/40.pdf