LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
The increasing significance of state-of-the-art convolutional neural network (CNN) models in computer vision tasks has led to their widespread use in industry and academia. However, deploying these models in resource-limited environments, such as IoT devices or embedded GPUs, presents challenges due...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10168115/ |
_version_ | 1797783588297506816 |
---|---|
author | Hanan Hussain P. S. Tamizharasan Praveen Kumar Yadav |
author_facet | Hanan Hussain P. S. Tamizharasan Praveen Kumar Yadav |
author_sort | Hanan Hussain |
collection | DOAJ |
description | The increasing significance of state-of-the-art convolutional neural network (CNN) models in computer vision tasks has led to their widespread use in industry and academia. However, deploying these models in resource-limited environments, such as IoT devices or embedded GPUs, presents challenges due to increased complexities and resource consumption. This research paper proposes an optimization algorithm called Layer-wise Complexity Reduction Method (LCRM) to address these challenges by converting accuracy-focused CNNs into lightweight models. It evaluates the standard convolution layers and replaces them with the most efficient combination of substitutional convolutions based on the output channel size. The primary goal is to reduce the computational complexity of the parent models and the hardware requirements. We assess the effectiveness of our framework by evaluating its performance on various standard CNN models, including AlexNet, VGG-9, U-Net, and Retinex-Net, for different applications such as image classification, optical character recognition, image segmentation, and image enhancement. Our experimental results show up to a 95% reduction in inference latency and up to 93% reduction in energy consumption when deployed on GPU. Furthermore, we compare the LCRM-optimized CNN models with state-of-the-art CNN optimization methods, including pruning, quantization, clustering, and their four cascaded optimization methods, by deploying them on Raspberry Pi-4. The profiling experiments performed on each model demonstrate that the LCRM-optimized CNN models achieve comparable or better accuracy than the parent models while providing added benefits such as a 62.84% reduction in inference latency on end devices with significant memory compression and complexity reductions. |
first_indexed | 2024-03-13T00:28:00Z |
format | Article |
id | doaj.art-554cf1cedb40410fa2ad3b82086d46ab |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-03-13T00:28:00Z |
publishDate | 2023-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-554cf1cedb40410fa2ad3b82086d46ab2023-07-10T23:00:27ZengIEEEIEEE Access2169-35362023-01-0111668386685710.1109/ACCESS.2023.329062010168115LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End DevicesHanan Hussain0https://orcid.org/0000-0002-4669-4402P. S. Tamizharasan1https://orcid.org/0000-0002-4932-5127Praveen Kumar Yadav2https://orcid.org/0000-0001-8634-1110Department of Computer Science and Engineering, BITS Pilani, Dubai Campus, DIAC, Dubai, United Arab EmiratesDepartment of Computer Science and Engineering, BITS Pilani, Dubai Campus, DIAC, Dubai, United Arab EmiratesAtlastream Pte Ltd., Central Region, SingaporeThe increasing significance of state-of-the-art convolutional neural network (CNN) models in computer vision tasks has led to their widespread use in industry and academia. However, deploying these models in resource-limited environments, such as IoT devices or embedded GPUs, presents challenges due to increased complexities and resource consumption. This research paper proposes an optimization algorithm called Layer-wise Complexity Reduction Method (LCRM) to address these challenges by converting accuracy-focused CNNs into lightweight models. It evaluates the standard convolution layers and replaces them with the most efficient combination of substitutional convolutions based on the output channel size. The primary goal is to reduce the computational complexity of the parent models and the hardware requirements. We assess the effectiveness of our framework by evaluating its performance on various standard CNN models, including AlexNet, VGG-9, U-Net, and Retinex-Net, for different applications such as image classification, optical character recognition, image segmentation, and image enhancement. Our experimental results show up to a 95% reduction in inference latency and up to 93% reduction in energy consumption when deployed on GPU. Furthermore, we compare the LCRM-optimized CNN models with state-of-the-art CNN optimization methods, including pruning, quantization, clustering, and their four cascaded optimization methods, by deploying them on Raspberry Pi-4. The profiling experiments performed on each model demonstrate that the LCRM-optimized CNN models achieve comparable or better accuracy than the parent models while providing added benefits such as a 62.84% reduction in inference latency on end devices with significant memory compression and complexity reductions.https://ieeexplore.ieee.org/document/10168115/Convolution neural networkefficient modelsoptimization techniquesIoT applicationsperformance metricsRaspberry pi |
spellingShingle | Hanan Hussain P. S. Tamizharasan Praveen Kumar Yadav LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices IEEE Access Convolution neural network efficient models optimization techniques IoT applications performance metrics Raspberry pi |
title | LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices |
title_full | LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices |
title_fullStr | LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices |
title_full_unstemmed | LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices |
title_short | LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices |
title_sort | lcrm layer wise complexity reduction method for cnn model optimization on end devices |
topic | Convolution neural network efficient models optimization techniques IoT applications performance metrics Raspberry pi |
url | https://ieeexplore.ieee.org/document/10168115/ |
work_keys_str_mv | AT hananhussain lcrmlayerwisecomplexityreductionmethodforcnnmodeloptimizationonenddevices AT pstamizharasan lcrmlayerwisecomplexityreductionmethodforcnnmodeloptimizationonenddevices AT praveenkumaryadav lcrmlayerwisecomplexityreductionmethodforcnnmodeloptimizationonenddevices |