Investigating the Potential of Network Optimization for a Constrained Object Detection Problem

Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be d...

Full description

Bibliographic Details
Main Authors:	Tanguy Ophoff, Cédric Gullentops, Kristof Van Beeck, Toon Goedemé
Format:	Article
Language:	English
Published:	MDPI AG 2021-04-01
Series:	Journal of Imaging
Subjects:	object detection single-shot embedded devices mobile convolutions depth-wise separable convolutions pruning
Online Access:	https://www.mdpi.com/2313-433X/7/4/64

_version_	1797539081422372864
author	Tanguy Ophoff Cédric Gullentops Kristof Van Beeck Toon Goedemé
author_facet	Tanguy Ophoff Cédric Gullentops Kristof Van Beeck Toon Goedemé
author_sort	Tanguy Ophoff
collection	DOAJ
description	Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).
first_indexed	2024-03-10T12:41:11Z
format	Article
id	doaj.art-09a51e1df998439781a994af48bdd67f
institution	Directory Open Access Journal
issn	2313-433X
language	English
last_indexed	2024-03-10T12:41:11Z
publishDate	2021-04-01
publisher	MDPI AG
record_format	Article
series	Journal of Imaging
spelling	doaj.art-09a51e1df998439781a994af48bdd67f2023-11-21T13:51:18ZengMDPI AGJournal of Imaging2313-433X2021-04-01746410.3390/jimaging7040064Investigating the Potential of Network Optimization for a Constrained Object Detection ProblemTanguy Ophoff0Cédric Gullentops1Kristof Van Beeck2Toon Goedemé3EAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumEAVISE, PSI, KU Leuven, Jan Pieter De Nayerlaan 5, 2860 Sint-Katelijne-Waver, BelgiumObject detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).https://www.mdpi.com/2313-433X/7/4/64object detectionsingle-shotembedded devicesmobile convolutionsdepth-wise separable convolutionspruning
spellingShingle	Tanguy Ophoff Cédric Gullentops Kristof Van Beeck Toon Goedemé Investigating the Potential of Network Optimization for a Constrained Object Detection Problem Journal of Imaging object detection single-shot embedded devices mobile convolutions depth-wise separable convolutions pruning
title	Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_full	Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_fullStr	Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_full_unstemmed	Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_short	Investigating the Potential of Network Optimization for a Constrained Object Detection Problem
title_sort	investigating the potential of network optimization for a constrained object detection problem
topic	object detection single-shot embedded devices mobile convolutions depth-wise separable convolutions pruning
url	https://www.mdpi.com/2313-433X/7/4/64
work_keys_str_mv	AT tanguyophoff investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem AT cedricgullentops investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem AT kristofvanbeeck investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem AT toongoedeme investigatingthepotentialofnetworkoptimizationforaconstrainedobjectdetectionproblem

Investigating the Potential of Network Optimization for a Constrained Object Detection Problem

Similar Items