Towards Convolutional Neural Network Acceleration and Compression Based on <i>Simon</i><i>k</i>-Means

Convolutional Neural Networks (CNNs) are popular models that are widely used in image classification, target recognition, and other fields. Model compression is a common step in transplanting neural networks into embedded devices, and it is often used in the retraining stage. However, it requires a...

Full description

Bibliographic Details
Main Authors:	Mingjie Wei, Yunping Zhao, Xiaowen Chen, Chen Li, Jianzhuang Lu
Format:	Article
Language:	English
Published:	MDPI AG 2022-06-01
Series:	Sensors
Subjects:	convolutional neural networks deep learning k-means model compression weight quantization
Online Access:	https://www.mdpi.com/1424-8220/22/11/4298

Description
Summary:	Convolutional Neural Networks (CNNs) are popular models that are widely used in image classification, target recognition, and other fields. Model compression is a common step in transplanting neural networks into embedded devices, and it is often used in the retraining stage. However, it requires a high expenditure of time by retraining weight data to atone for the loss of precision. Unlike in prior designs, we propose a novel model compression approach based on <i>Simon</i><i>k</i>-means, which is specifically designed to support a hardware acceleration scheme. First, we propose an extension algorithm named <i>Simon</i><i>k</i>-means based on simple <i>k</i>-means. We use <i>Simon</i><i>k</i>-means to cluster trained weights in convolutional layers and fully connected layers. Second, we reduce the consumption of hardware resources in data movement and storage by using a data storage and index approach. Finally, we provide the hardware implementation of the compressed CNN accelerator. Our evaluations on several classifications show that our design can achieve 5.27× compression and reduce 74.3% of the multiply–accumulate (MAC) operations in AlexNet on the FASHION-MNIST dataset.
ISSN:	1424-8220

Towards Convolutional Neural Network Acceleration and Compression Based on <i>Simon</i><i>k</i>-Means

Similar Items