Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be d...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-04-01
|
Series: | Journal of Imaging |
Subjects: | |
Online Access: | https://www.mdpi.com/2313-433X/7/4/64 |
_version_ | 1797539081422372864 |
---|---|
author | Tanguy Ophoff Cédric Gullentops Kristof Van Beeck Toon Goedemé |
author_facet | Tanguy Ophoff Cédric Gullentops Kristof Van Beeck Toon Goedemé |
author_sort | Tanguy Ophoff |
collection | DOAJ |
description | Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP). |
first_indexed | 2024-03-10T12:41:11Z |
format | Article |
id | doaj.art-09a51e1df998439781a994af48bdd67f |
institution | Directory Open Access Journal |
issn | 2313-433X |
language | English |
last_indexed | 2024-03-10T12:41:11Z |
publishDate | 2021-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Journal of Imaging |
spelling | doaj.art-09a51e1df998439781a994af48bdd67f2023-11-21T13:51:18ZengMDPI AGJournal of Imaging2313-433X2021-04-01746410.3390/jimaging7040064Investigating the Potential of Network Optimization for a Constrained Object Detection ProblemTanguy Ophoff0Cédric Gullentops1Kristof Van Beeck2Toon Goedemé3EAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumObject detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).https://www.mdpi.com/2313-433X/7/4/64object detectionsingle-shotembedded devicesmobile convolutionsdepth-wise separable convolutionspruning |
spellingShingle | Tanguy Ophoff Cédric Gullentops Kristof Van Beeck Toon Goedemé Investigating the Potential of Network Optimization for a Constrained Object Detection Problem Journal of Imaging object detection single-shot embedded devices mobile convolutions depth-wise separable convolutions pruning |
title | Investigating the Potential of Network Optimization for a Constrained Object Detection Problem |
title_full | Investigating the Potential of Network Optimization for a Constrained Object Detection Problem |
title_fullStr | Investigating the Potential of Network Optimization for a Constrained Object Detection Problem |
title_full_unstemmed | Investigating the Potential of Network Optimization for a Constrained Object Detection Problem |
title_short | Investigating the Potential of Network Optimization for a Constrained Object Detection Problem |
title_sort | investigating the potential of network optimization for a constrained object detection problem |
topic | object detection single-shot embedded devices mobile convolutions depth-wise separable convolutions pruning |
url | https://www.mdpi.com/2313-433X/7/4/64 |
work_keys_str_mv | AT tanguyophoff investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem AT cedricgullentops investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem AT kristofvanbeeck investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem AT toongoedeme investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem |