Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images

In recent years, object counting has been investigated and has made significant progress under a surveillance-view. However, there exists only a few works focusing on the remote sensing object density estimation, and the performance of existing methods is not promising. On the one hand, due to the i...

Full description

Bibliographic Details
Main Authors:	Junyu Gao, Maoguo Gong, Xuelong Li
Format:	Article
Language:	English
Published:	MDPI AG 2022-08-01
Series:	Remote Sensing
Subjects:	couting remote counting transformer remote sensing image
Online Access:	https://www.mdpi.com/2072-4292/14/16/4026

_version_	1797442215955398656
author	Junyu Gao Maoguo Gong Xuelong Li
author_facet	Junyu Gao Maoguo Gong Xuelong Li
author_sort	Junyu Gao
collection	DOAJ
description	In recent years, object counting has been investigated and has made significant progress under a surveillance-view. However, there exists only a few works focusing on the remote sensing object density estimation, and the performance of existing methods is not promising. On the one hand, due to the imbalance distribution of targets in remote sensing images, the model might collapse, leading a severe degradation. On the other hand, the scale of targets in remote sensing images actually varies in real scenarios, which remains a challenge for counting objects accurately. To remedy the above problems, we propose an approach named “SwinCounter” for object counting in remote sensing. Moreover, we introduce a Balanced MSE Loss to pay more attention to the fewer samples, which alleviates the problem of imbalanced object labels. In addition, the attention mechanism in our SwinCounter can precisely capture multi-scale information. Thus, the model is aware of different scales of objects, which capture small and dense targetes more precisely. We build experiments on the RSOC dataset, achieving MAEs of 7.2, 151.5, 14.38, and 52.88 and MSEs of 10.1, 436.0, 22.7, and 74.82 on the Building, Small-Vehicle, Large-Vehicle, and Ship sub-datasets, which demonstrates the competitiveness and superiority of the proposed method.
first_indexed	2024-03-09T12:39:36Z
format	Article
id	doaj.art-2787c0f113cc49cf9fa015479f282b0e
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-09T12:39:36Z
publishDate	2022-08-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-2787c0f113cc49cf9fa015479f282b0e2023-11-30T22:20:09ZengMDPI AGRemote Sensing2072-42922022-08-011416402610.3390/rs14164026Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing ImagesJunyu Gao0Maoguo Gong1Xuelong Li2Academy of Advanced Interdisciplinary Research, Xidian University, Xi’an 710071, ChinaKey Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, International Research Center for Intelligent Perception and Computation, Xidian University, Xi’an 710071, ChinaSchool of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University, Xi’an 710072, ChinaIn recent years, object counting has been investigated and has made significant progress under a surveillance-view. However, there exists only a few works focusing on the remote sensing object density estimation, and the performance of existing methods is not promising. On the one hand, due to the imbalance distribution of targets in remote sensing images, the model might collapse, leading a severe degradation. On the other hand, the scale of targets in remote sensing images actually varies in real scenarios, which remains a challenge for counting objects accurately. To remedy the above problems, we propose an approach named “SwinCounter” for object counting in remote sensing. Moreover, we introduce a Balanced MSE Loss to pay more attention to the fewer samples, which alleviates the problem of imbalanced object labels. In addition, the attention mechanism in our SwinCounter can precisely capture multi-scale information. Thus, the model is aware of different scales of objects, which capture small and dense targetes more precisely. We build experiments on the RSOC dataset, achieving MAEs of 7.2, 151.5, 14.38, and 52.88 and MSEs of 10.1, 436.0, 22.7, and 74.82 on the Building, Small-Vehicle, Large-Vehicle, and Ship sub-datasets, which demonstrates the competitiveness and superiority of the proposed method.https://www.mdpi.com/2072-4292/14/16/4026coutingremote countingtransformerremote sensing image
spellingShingle	Junyu Gao Maoguo Gong Xuelong Li Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images Remote Sensing couting remote counting transformer remote sensing image
title	Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images
title_full	Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images
title_fullStr	Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images
title_full_unstemmed	Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images
title_short	Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images
title_sort	global multi scale information fusion for multi class object counting in remote sensing images
topic	couting remote counting transformer remote sensing image
url	https://www.mdpi.com/2072-4292/14/16/4026
work_keys_str_mv	AT junyugao globalmultiscaleinformationfusionformulticlassobjectcountinginremotesensingimages AT maoguogong globalmultiscaleinformationfusionformulticlassobjectcountinginremotesensingimages AT xuelongli globalmultiscaleinformationfusionformulticlassobjectcountinginremotesensingimages

Global Multi-Scale Information Fusion for Multi-Class Object Counting in Remote Sensing Images

Similar Items