Learning Low-Precision Structured Subnetworks Using Joint Layerwise Channel Pruning and Uniform Quantization

Pruning and quantization are core techniques used to reduce the inference costs of deep neural networks. Among the state-of-the-art pruning techniques, magnitude-based pruning algorithms have demonstrated consistent success in the reduction of both weight and feature map complexity. However, we find...

Full description

Bibliographic Details
Main Authors:	Xinyu Zhang, Ian Colbert, Srinjoy Das
Format:	Article
Language:	English
Published:	MDPI AG 2022-08-01
Series:	Applied Sciences
Subjects:	channel pruning layerwise pruning quantization joint pruning
Online Access:	https://www.mdpi.com/2076-3417/12/15/7829

Internet

https://www.mdpi.com/2076-3417/12/15/7829

Learning Low-Precision Structured Subnetworks Using Joint Layerwise Channel Pruning and Uniform Quantization

Internet

Similar Items