Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern

The pattern of bounding box representation and regression has long been dominant in CNN-based pedestrian detectors. Despite the method’s success, it cannot accurately represent location, and introduces unnecessary background information, while pedestrian features are mainly located in axis-line area...

Full description

Bibliographic Details
Main Authors: Mengxue Zhang, Qiong Liu
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/10/3312
_version_ 1797534685427924992
author Mengxue Zhang
Qiong Liu
author_facet Mengxue Zhang
Qiong Liu
author_sort Mengxue Zhang
collection DOAJ
description The pattern of bounding box representation and regression has long been dominant in CNN-based pedestrian detectors. Despite the method’s success, it cannot accurately represent location, and introduces unnecessary background information, while pedestrian features are mainly located in axis-line areas. Other object representations, such as corner-pairs, are not easy to obtain by regression because the corners are far from the axis-line and are greatly affected by background features. In this paper, we propose a novel detection pattern, named Axis-line Representation and Regression (ALR), for pedestrian detection in road scenes. Specifically, we design a 3-d axis-line representation for pedestrians and use it as the regression target during network training. A line-box transformation method is also proposed to fit the widely used box-annotations. Meanwhile, we explore the influence of deformable convolution base-offset on detection performance and propose a base-offset initialization strategy to further promote the gain brought by ALR. Notably, the proposed ALR pattern can be introduced into both anchor-based and anchor-free frameworks. We validate the effectiveness of ALR on the Caltech-USA and CityPersons datasets. Experimental results show that our approach outperforms the baseline significantly through simple modifications and achieves competitive accuracy with other methods without bells and whistles.
first_indexed 2024-03-10T11:32:58Z
format Article
id doaj.art-aa541a1bfa7b4b958b2d3d0226aa7d12
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T11:32:58Z
publishDate 2021-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-aa541a1bfa7b4b958b2d3d0226aa7d122023-11-21T19:06:42ZengMDPI AGSensors1424-82202021-05-012110331210.3390/s21103312Pedestrian Detection by Novel Axis-Line Representation and Regression PatternMengxue Zhang0Qiong Liu1School of Software Engineering, South China University of Technology, Guangzhou 510006, ChinaSchool of Software Engineering, South China University of Technology, Guangzhou 510006, ChinaThe pattern of bounding box representation and regression has long been dominant in CNN-based pedestrian detectors. Despite the method’s success, it cannot accurately represent location, and introduces unnecessary background information, while pedestrian features are mainly located in axis-line areas. Other object representations, such as corner-pairs, are not easy to obtain by regression because the corners are far from the axis-line and are greatly affected by background features. In this paper, we propose a novel detection pattern, named Axis-line Representation and Regression (ALR), for pedestrian detection in road scenes. Specifically, we design a 3-d axis-line representation for pedestrians and use it as the regression target during network training. A line-box transformation method is also proposed to fit the widely used box-annotations. Meanwhile, we explore the influence of deformable convolution base-offset on detection performance and propose a base-offset initialization strategy to further promote the gain brought by ALR. Notably, the proposed ALR pattern can be introduced into both anchor-based and anchor-free frameworks. We validate the effectiveness of ALR on the Caltech-USA and CityPersons datasets. Experimental results show that our approach outperforms the baseline significantly through simple modifications and achieves competitive accuracy with other methods without bells and whistles.https://www.mdpi.com/1424-8220/21/10/3312pedestrian detectionobject representationaxis lineroad scene
spellingShingle Mengxue Zhang
Qiong Liu
Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern
Sensors
pedestrian detection
object representation
axis line
road scene
title Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern
title_full Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern
title_fullStr Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern
title_full_unstemmed Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern
title_short Pedestrian Detection by Novel Axis-Line Representation and Regression Pattern
title_sort pedestrian detection by novel axis line representation and regression pattern
topic pedestrian detection
object representation
axis line
road scene
url https://www.mdpi.com/1424-8220/21/10/3312
work_keys_str_mv AT mengxuezhang pedestriandetectionbynovelaxislinerepresentationandregressionpattern
AT qiongliu pedestriandetectionbynovelaxislinerepresentationandregressionpattern