LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices

The increasing significance of state-of-the-art convolutional neural network (CNN) models in computer vision tasks has led to their widespread use in industry and academia. However, deploying these models in resource-limited environments, such as IoT devices or embedded GPUs, presents challenges due...

Full description

Bibliographic Details
Main Authors: Hanan Hussain, P. S. Tamizharasan, Praveen Kumar Yadav
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10168115/
_version_ 1797783588297506816
author Hanan Hussain
P. S. Tamizharasan
Praveen Kumar Yadav
author_facet Hanan Hussain
P. S. Tamizharasan
Praveen Kumar Yadav
author_sort Hanan Hussain
collection DOAJ
description The increasing significance of state-of-the-art convolutional neural network (CNN) models in computer vision tasks has led to their widespread use in industry and academia. However, deploying these models in resource-limited environments, such as IoT devices or embedded GPUs, presents challenges due to increased complexities and resource consumption. This research paper proposes an optimization algorithm called Layer-wise Complexity Reduction Method (LCRM) to address these challenges by converting accuracy-focused CNNs into lightweight models. It evaluates the standard convolution layers and replaces them with the most efficient combination of substitutional convolutions based on the output channel size. The primary goal is to reduce the computational complexity of the parent models and the hardware requirements. We assess the effectiveness of our framework by evaluating its performance on various standard CNN models, including AlexNet, VGG-9, U-Net, and Retinex-Net, for different applications such as image classification, optical character recognition, image segmentation, and image enhancement. Our experimental results show up to a 95% reduction in inference latency and up to 93% reduction in energy consumption when deployed on GPU. Furthermore, we compare the LCRM-optimized CNN models with state-of-the-art CNN optimization methods, including pruning, quantization, clustering, and their four cascaded optimization methods, by deploying them on Raspberry Pi-4. The profiling experiments performed on each model demonstrate that the LCRM-optimized CNN models achieve comparable or better accuracy than the parent models while providing added benefits such as a 62.84% reduction in inference latency on end devices with significant memory compression and complexity reductions.
first_indexed 2024-03-13T00:28:00Z
format Article
id doaj.art-554cf1cedb40410fa2ad3b82086d46ab
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-13T00:28:00Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-554cf1cedb40410fa2ad3b82086d46ab2023-07-10T23:00:27ZengIEEEIEEE Access2169-35362023-01-0111668386685710.1109/ACCESS.2023.329062010168115LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End DevicesHanan Hussain0https://orcid.org/0000-0002-4669-4402P. S. Tamizharasan1https://orcid.org/0000-0002-4932-5127Praveen Kumar Yadav2https://orcid.org/0000-0001-8634-1110Department of Computer Science and Engineering, BITS Pilani, Dubai Campus, DIAC, Dubai, United Arab EmiratesDepartment of Computer Science and Engineering, BITS Pilani, Dubai Campus, DIAC, Dubai, United Arab EmiratesAtlastream Pte Ltd., Central Region, SingaporeThe increasing significance of state-of-the-art convolutional neural network (CNN) models in computer vision tasks has led to their widespread use in industry and academia. However, deploying these models in resource-limited environments, such as IoT devices or embedded GPUs, presents challenges due to increased complexities and resource consumption. This research paper proposes an optimization algorithm called Layer-wise Complexity Reduction Method (LCRM) to address these challenges by converting accuracy-focused CNNs into lightweight models. It evaluates the standard convolution layers and replaces them with the most efficient combination of substitutional convolutions based on the output channel size. The primary goal is to reduce the computational complexity of the parent models and the hardware requirements. We assess the effectiveness of our framework by evaluating its performance on various standard CNN models, including AlexNet, VGG-9, U-Net, and Retinex-Net, for different applications such as image classification, optical character recognition, image segmentation, and image enhancement. Our experimental results show up to a 95% reduction in inference latency and up to 93% reduction in energy consumption when deployed on GPU. Furthermore, we compare the LCRM-optimized CNN models with state-of-the-art CNN optimization methods, including pruning, quantization, clustering, and their four cascaded optimization methods, by deploying them on Raspberry Pi-4. The profiling experiments performed on each model demonstrate that the LCRM-optimized CNN models achieve comparable or better accuracy than the parent models while providing added benefits such as a 62.84% reduction in inference latency on end devices with significant memory compression and complexity reductions.https://ieeexplore.ieee.org/document/10168115/Convolution neural networkefficient modelsoptimization techniquesIoT applicationsperformance metricsRaspberry pi
spellingShingle Hanan Hussain
P. S. Tamizharasan
Praveen Kumar Yadav
LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
IEEE Access
Convolution neural network
efficient models
optimization techniques
IoT applications
performance metrics
Raspberry pi
title LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
title_full LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
title_fullStr LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
title_full_unstemmed LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
title_short LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
title_sort lcrm layer wise complexity reduction method for cnn model optimization on end devices
topic Convolution neural network
efficient models
optimization techniques
IoT applications
performance metrics
Raspberry pi
url https://ieeexplore.ieee.org/document/10168115/
work_keys_str_mv AT hananhussain lcrmlayerwisecomplexityreductionmethodforcnnmodeloptimizationonenddevices
AT pstamizharasan lcrmlayerwisecomplexityreductionmethodforcnnmodeloptimizationonenddevices
AT praveenkumaryadav lcrmlayerwisecomplexityreductionmethodforcnnmodeloptimizationonenddevices