Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework

Person detection in real videos and images is a classical research problem in computer vision. Person detection is a nontrivial problem that offers many challenges due to several nuisances that commonly observed in natural videos. Among these, scale is the main challenging problem in various object...

Full description

Bibliographic Details
Main Authors: Sultan Daud Khan, Saleh Basalamah
Format: Article
Language:English
Published: Springer 2021-03-01
Series:International Journal of Computational Intelligence Systems
Subjects:
Online Access:https://www.atlantis-press.com/article/125954999/view
_version_ 1811301946480394240
author Sultan Daud Khan
Saleh Basalamah
author_facet Sultan Daud Khan
Saleh Basalamah
author_sort Sultan Daud Khan
collection DOAJ
description Person detection in real videos and images is a classical research problem in computer vision. Person detection is a nontrivial problem that offers many challenges due to several nuisances that commonly observed in natural videos. Among these, scale is the main challenging problem in various object detection tasks. To solve the scale problem, we propose a framework that estimates the scales of person’s heads, as we argue that head is the only visible part in complex scenes. we propose a head detection framework that explicitly handles head scales. The framework consists of two sequential networks: (1) scale estimation network (SENet) and (2) head detection network. SENet predicts the distribution of scales from the input image in the form of histogram. Then the scale histogram adjust anchor scale set of region proposal network that generates object proposals. These objects proposals are then classified into two classes, that is, head and background by the detection network. We evaluate proposed framework on three challenging benchmark datasets. Experiment results show that proposed framework achieves state-of-the-art performance.
first_indexed 2024-04-13T07:18:06Z
format Article
id doaj.art-9627b1a6488d4927b71b14f5d6f4ef5f
institution Directory Open Access Journal
issn 1875-6883
language English
last_indexed 2024-04-13T07:18:06Z
publishDate 2021-03-01
publisher Springer
record_format Article
series International Journal of Computational Intelligence Systems
spelling doaj.art-9627b1a6488d4927b71b14f5d6f4ef5f2022-12-22T02:56:41ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832021-03-0114110.2991/ijcis.d.210326.001Multi-Scale Person Localization With Multi-Stage Deep Sequential FrameworkSultan Daud KhanSaleh BasalamahPerson detection in real videos and images is a classical research problem in computer vision. Person detection is a nontrivial problem that offers many challenges due to several nuisances that commonly observed in natural videos. Among these, scale is the main challenging problem in various object detection tasks. To solve the scale problem, we propose a framework that estimates the scales of person’s heads, as we argue that head is the only visible part in complex scenes. we propose a head detection framework that explicitly handles head scales. The framework consists of two sequential networks: (1) scale estimation network (SENet) and (2) head detection network. SENet predicts the distribution of scales from the input image in the form of histogram. Then the scale histogram adjust anchor scale set of region proposal network that generates object proposals. These objects proposals are then classified into two classes, that is, head and background by the detection network. We evaluate proposed framework on three challenging benchmark datasets. Experiment results show that proposed framework achieves state-of-the-art performance.https://www.atlantis-press.com/article/125954999/viewScale estimationDeep learningHead detectionCrowd analysis
spellingShingle Sultan Daud Khan
Saleh Basalamah
Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework
International Journal of Computational Intelligence Systems
Scale estimation
Deep learning
Head detection
Crowd analysis
title Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework
title_full Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework
title_fullStr Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework
title_full_unstemmed Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework
title_short Multi-Scale Person Localization With Multi-Stage Deep Sequential Framework
title_sort multi scale person localization with multi stage deep sequential framework
topic Scale estimation
Deep learning
Head detection
Crowd analysis
url https://www.atlantis-press.com/article/125954999/view
work_keys_str_mv AT sultandaudkhan multiscalepersonlocalizationwithmultistagedeepsequentialframework
AT salehbasalamah multiscalepersonlocalizationwithmultistagedeepsequentialframework