CAGNet: coordinated attention guidance network for RGB-T crowd counting

Estimating crowd density is a demanding task that has garnered significant research attention in urban planning, intelligent transportation, and other related fields. This study utilizes RGB and thermal images to leverage multimodal information and introduces a coordinated attention guidance network...

Full description

Bibliographic Details
Main Authors: Yang, Xun, Zhou, Wujie, Yan, Weiqing, Qian, Xiaohong
Other Authors: School of Computer Science and Engineering
Format: Journal Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/180135
_version_ 1826117239332405248
author Yang, Xun
Zhou, Wujie
Yan, Weiqing
Qian, Xiaohong
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Yang, Xun
Zhou, Wujie
Yan, Weiqing
Qian, Xiaohong
author_sort Yang, Xun
collection NTU
description Estimating crowd density is a demanding task that has garnered significant research attention in urban planning, intelligent transportation, and other related fields. This study utilizes RGB and thermal images to leverage multimodal information and introduces a coordinated attention guidance network (CAGNet) for RGB-thermal (RGB-T) crowd counting. The framework enhances the expressive capabilities of backbone features by incorporating overall and local relationships through the information enhancement unit module, which utilizes the context coordination perception module for horizontal information mining, interactively compensating for the continuity of spatial information. Subsequently, it utilizes diverse multi-level information features for hierarchical intersection and progression, resulting in an accurate crowd counting density map. Experimental results on the RGBT-CC benchmark dataset demonstrate the robustness and effectiveness of CAGNet for RGB-T crowd counting. Furthermore, the proposed CAGNet can be extended to crowd density estimation and has achieved high performance on the ShanghaiTechRGBD and DroneRGBT datasets. The former dataset comprises paired RGB images and depth maps. The code and model for this research are available at https://github.com/WBangG/CAGNet.
first_indexed 2024-10-01T04:24:05Z
format Journal Article
id ntu-10356/180135
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:24:05Z
publishDate 2024
record_format dspace
spelling ntu-10356/1801352024-09-18T06:13:54Z CAGNet: coordinated attention guidance network for RGB-T crowd counting Yang, Xun Zhou, Wujie Yan, Weiqing Qian, Xiaohong School of Computer Science and Engineering Computer and Information Science Urban planning Intelligent transportation Estimating crowd density is a demanding task that has garnered significant research attention in urban planning, intelligent transportation, and other related fields. This study utilizes RGB and thermal images to leverage multimodal information and introduces a coordinated attention guidance network (CAGNet) for RGB-thermal (RGB-T) crowd counting. The framework enhances the expressive capabilities of backbone features by incorporating overall and local relationships through the information enhancement unit module, which utilizes the context coordination perception module for horizontal information mining, interactively compensating for the continuity of spatial information. Subsequently, it utilizes diverse multi-level information features for hierarchical intersection and progression, resulting in an accurate crowd counting density map. Experimental results on the RGBT-CC benchmark dataset demonstrate the robustness and effectiveness of CAGNet for RGB-T crowd counting. Furthermore, the proposed CAGNet can be extended to crowd density estimation and has achieved high performance on the ShanghaiTechRGBD and DroneRGBT datasets. The former dataset comprises paired RGB images and depth maps. The code and model for this research are available at https://github.com/WBangG/CAGNet. This work was supported by the National Natural Science Foundation of China (61502429); and the Zhejiang Provincial Natural Science Foundation of China (LY18F020012). 2024-09-18T06:13:54Z 2024-09-18T06:13:54Z 2024 Journal Article Yang, X., Zhou, W., Yan, W. & Qian, X. (2024). CAGNet: coordinated attention guidance network for RGB-T crowd counting. Expert Systems With Applications, 243, 122753-. https://dx.doi.org/10.1016/j.eswa.2023.122753 0957-4174 https://hdl.handle.net/10356/180135 10.1016/j.eswa.2023.122753 2-s2.0-85183588558 243 122753 en Expert Systems with Applications © 2023 Elsevier Ltd. All rights reserved.
spellingShingle Computer and Information Science
Urban planning
Intelligent transportation
Yang, Xun
Zhou, Wujie
Yan, Weiqing
Qian, Xiaohong
CAGNet: coordinated attention guidance network for RGB-T crowd counting
title CAGNet: coordinated attention guidance network for RGB-T crowd counting
title_full CAGNet: coordinated attention guidance network for RGB-T crowd counting
title_fullStr CAGNet: coordinated attention guidance network for RGB-T crowd counting
title_full_unstemmed CAGNet: coordinated attention guidance network for RGB-T crowd counting
title_short CAGNet: coordinated attention guidance network for RGB-T crowd counting
title_sort cagnet coordinated attention guidance network for rgb t crowd counting
topic Computer and Information Science
Urban planning
Intelligent transportation
url https://hdl.handle.net/10356/180135
work_keys_str_mv AT yangxun cagnetcoordinatedattentionguidancenetworkforrgbtcrowdcounting
AT zhouwujie cagnetcoordinatedattentionguidancenetworkforrgbtcrowdcounting
AT yanweiqing cagnetcoordinatedattentionguidancenetworkforrgbtcrowdcounting
AT qianxiaohong cagnetcoordinatedattentionguidancenetworkforrgbtcrowdcounting