Investigating the Potential of Network Optimization for a Constrained Object Detection Problem

Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be d...

Full description

Bibliographic Details
Main Authors: Tanguy Ophoff, Cédric Gullentops, Kristof Van Beeck, Toon Goedemé
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Journal of Imaging
Subjects:
Online Access:https://www.mdpi.com/2313-433X/7/4/64
_version_ 1797539081422372864
author Tanguy Ophoff
Cédric Gullentops
Kristof Van Beeck
Toon Goedemé
author_facet Tanguy Ophoff
Cédric Gullentops
Kristof Van Beeck
Toon Goedemé
author_sort Tanguy Ophoff
collection DOAJ
description Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).
first_indexed 2024-03-10T12:41:11Z
format Article
id doaj.art-09a51e1df998439781a994af48bdd67f
institution Directory Open Access Journal
issn 2313-433X
language English
last_indexed 2024-03-10T12:41:11Z
publishDate 2021-04-01
publisher MDPI AG
record_format Article
series Journal of Imaging
spelling doaj.art-09a51e1df998439781a994af48bdd67f2023-11-21T13:51:18ZengMDPI AGJournal of Imaging2313-433X2021-04-01746410.3390/jimaging7040064Investigating the Potential of Network Optimization for a Constrained Object Detection ProblemTanguy Ophoff0Cédric Gullentops1Kristof Van Beeck2Toon Goedemé3EAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumObject detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).https://www.mdpi.com/2313-433X/7/4/64object detectionsingle-shotembedded devicesmobile convolutionsdepth-wise separable convolutionspruning
spellingShingle Tanguy Ophoff
Cédric Gullentops
Kristof Van Beeck
Toon Goedemé
Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
Journal of Imaging
object detection
single-shot
embedded devices
mobile convolutions
depth-wise separable convolutions
pruning
title Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_full Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_fullStr Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_full_unstemmed Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_short Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_sort investigating the potential of network optimization for a constrained object detection problem
topic object detection
single-shot
embedded devices
mobile convolutions
depth-wise separable convolutions
pruning
url https://www.mdpi.com/2313-433X/7/4/64
work_keys_str_mv AT tanguyophoff investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem
AT cedricgullentops investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem
AT kristofvanbeeck investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem
AT toongoedeme investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem