Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor
Operator acceleration libraries based on different hardware devices have become an indispensable part of deep learning framework,which can provide performance improvement for large-scale training or inference tasks dramatically.The current main-stream operator libraries are all developed based on GP...
Main Author: | GAO Jie, LIU Sha, HUANG Ze-qiang, ZHENG Tian-yu, LIU Xin, QI Feng-bin |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial office of Computer Science
2022-05-01
|
Series: | Jisuanji kexue |
Subjects: | |
Online Access: | https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-5-355.pdf |
Similar Items
-
Optimization of the Load Balancing Policy for Tiled Many-Core Processors
by: Ye Liu, et al.
Published: (2019-01-01) -
A Lightweight and High-Throughput Asynchronous Message Bus for Communication in Multi-Core Heterogeneous Systems
by: Qingyang Zeng, et al.
Published: (2024-01-01) -
SunwayImg: A Parallel Image Processing Library for the Sunway Many-Core Processor
by: Rui Liu, et al.
Published: (2019-01-01) -
Trend-Smooth: Accelerate Asynchronous SGD by Smoothing Parameters Using Parameter Trends
by: Guoxin Cui, et al.
Published: (2019-01-01) -
Implementation of Hybrid Alignment Algorithm for Protein Database Search on the SW26010 Many-Core Processor
by: Hao Zhang, et al.
Published: (2019-01-01)