Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention

Crowd counting aims to estimate the total number of people in an image and present its distribution accurately.The images in the relevant datasets usually involve a variety of scenes and include multiple people.To save labor,most datasets usually annotated each human head by a single point.However,t...

Full description

Bibliographic Details
Main Author:	ZHANG Yi, WU Qin
Format:	Article
Language:	zho
Published:	Editorial office of Computer Science 2023-03-01
Series:	Jisuanji kexue
Subjects:	crowd counting\|deep learning\|foreground segmentation\|background compensation\|density estimation
Online Access:	https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2023-50-3-246.pdf

_version_	1797845041078599680
author	ZHANG Yi, WU Qin
author_facet	ZHANG Yi, WU Qin
author_sort	ZHANG Yi, WU Qin
collection	DOAJ
description	Crowd counting aims to estimate the total number of people in an image and present its distribution accurately.The images in the relevant datasets usually involve a variety of scenes and include multiple people.To save labor,most datasets usually annotated each human head by a single point.However,the point labels cannot cover the full human head,which makes it difficult to converge the matching between the crowd feature and the distribution label,and the predicted values cannot be gathered in the foreground region,which seriously affects the density estimation map quality and count accuracy.To solve this problem,count loss is used to constrain the range of predictions on the full map,and a pixel-level distribution consistency loss is used to optimize the density map matching process.In addition,there are many background noises that are easily confused with crowd feature in complex scenes.In order to avoid the interference of false positive predictions on subsequent counting and density map estimation,a foreground segmentation module and feature enhancement loss are proposed to adaptively focus the foreground region and increase the contribution of human head features to the counts,so as to suppress background misjudgments.In addition,in order to make the network adapt to the multi-scale pattern of the human head better,up and down sampling operations are performed on each image to be trained to obtain the multi-scale pattern with the same object.Experiments on several datasets show that the proposed method achieves better or competitive results compared with state-of-the-art methods.
first_indexed	2024-04-09T17:33:12Z
format	Article
id	doaj.art-0bd3db80c583425681d818d0fb2cb01e
institution	Directory Open Access Journal
issn	1002-137X
language	zho
last_indexed	2024-04-09T17:33:12Z
publishDate	2023-03-01
publisher	Editorial office of Computer Science
record_format	Article
series	Jisuanji kexue
spelling	doaj.art-0bd3db80c583425681d818d0fb2cb01e2023-04-18T02:33:25ZzhoEditorial office of Computer ScienceJisuanji kexue1002-137X2023-03-0150324625310.11896/jsjkx.220100219Crowd Counting Network Based on Feature Enhancement Loss and Foreground AttentionZHANG Yi, WU Qin01 School of Artificial Intelligence and Computer Science,Jiangnan University,Wuxi,Jiangsu 214122,China;2 Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence,Jiangnan University,Wuxi,Jiangsu 214122, ChinaCrowd counting aims to estimate the total number of people in an image and present its distribution accurately.The images in the relevant datasets usually involve a variety of scenes and include multiple people.To save labor,most datasets usually annotated each human head by a single point.However,the point labels cannot cover the full human head,which makes it difficult to converge the matching between the crowd feature and the distribution label,and the predicted values cannot be gathered in the foreground region,which seriously affects the density estimation map quality and count accuracy.To solve this problem,count loss is used to constrain the range of predictions on the full map,and a pixel-level distribution consistency loss is used to optimize the density map matching process.In addition,there are many background noises that are easily confused with crowd feature in complex scenes.In order to avoid the interference of false positive predictions on subsequent counting and density map estimation,a foreground segmentation module and feature enhancement loss are proposed to adaptively focus the foreground region and increase the contribution of human head features to the counts,so as to suppress background misjudgments.In addition,in order to make the network adapt to the multi-scale pattern of the human head better,up and down sampling operations are performed on each image to be trained to obtain the multi-scale pattern with the same object.Experiments on several datasets show that the proposed method achieves better or competitive results compared with state-of-the-art methods.https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2023-50-3-246.pdfcrowd counting\|deep learning\|foreground segmentation\|background compensation\|density estimation
spellingShingle	ZHANG Yi, WU Qin Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention Jisuanji kexue crowd counting\|deep learning\|foreground segmentation\|background compensation\|density estimation
title	Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention
title_full	Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention
title_fullStr	Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention
title_full_unstemmed	Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention
title_short	Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention
title_sort	crowd counting network based on feature enhancement loss and foreground attention
topic	crowd counting\|deep learning\|foreground segmentation\|background compensation\|density estimation
url	https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2023-50-3-246.pdf
work_keys_str_mv	AT zhangyiwuqin crowdcountingnetworkbasedonfeatureenhancementlossandforegroundattention

Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention

Similar Items