High performance 2D convolution utilizing the AVX512 on a multi-core architecture
Convolution is a time consuming operation, especially for signal and image processing, which led us to develop an efficient implementation of 2D convolution for a multi-core architecture utilizing AVX512 intrinsics and OpenMP. For single precision convolution, our algorithm is on average 2.30, 3.8...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Prince of Songkla University
2021-08-01
|
Series: | Songklanakarin Journal of Science and Technology (SJST) |
Subjects: | |
Online Access: | https://rdo.psu.ac.th/sjst/journal/43-4/40.pdf |