Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification

In person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identificatio...

Full description

Bibliographic Details
Main Authors:	Li Xu, Xiang Fu
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Feature correlation multiscale reference-aided multilayer feature fusion person re-identification
Online Access:	https://ieeexplore.ieee.org/document/9568915/

_version_	1818365739316281344
author	Li Xu Xiang Fu
author_facet	Li Xu Xiang Fu
author_sort	Li Xu
collection	DOAJ
description	In person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identification tasks, the standard CNNs extract local features but are unable to obtain satisfactory results for global features extracted from the images. Local feature learning methods are helpful for obtaining more abundant features, which focus on the most significant local features and ignore the correlations between features of various parts of the human body. To solve the above problems, a new multiscale reference-aided attentive feature aggregation (MS-RAFA) mechanism is proposed, consisting of three main modules. First, to extract the most significant local features and strengthen the correlations between the features of various parts of the human body, an autoselect module (ASM) is designed, an attentional mechanism that can stack the structural information and spatial relations to form new features. Then, to realize multiscale feature fusion of the multiple output branches of the backbone network and increase feature diversity, we propose a multilayer feature fusion module (MFFM), which enables the model to mine the features hidden by salient features and to learn features better. Finally, to supervise the MFFM and make the network obtain better recognition features, we propose a multiple supervision mechanism. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on three large-scale datasets.
first_indexed	2024-12-13T22:25:03Z
format	Article
id	doaj.art-e8e4a3a54d8140ab8665a62dd3cf5a27
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-13T22:25:03Z
publishDate	2021-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-e8e4a3a54d8140ab8665a62dd3cf5a272022-12-21T23:29:15ZengIEEEIEEE Access2169-35362021-01-01914166714167710.1109/ACCESS.2021.31195769568915Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-IdentificationLi Xu0https://orcid.org/0000-0001-9793-8326Xiang Fu1https://orcid.org/0000-0002-4337-6206Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, ChinaKey Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, ChinaIn person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identification tasks, the standard CNNs extract local features but are unable to obtain satisfactory results for global features extracted from the images. Local feature learning methods are helpful for obtaining more abundant features, which focus on the most significant local features and ignore the correlations between features of various parts of the human body. To solve the above problems, a new multiscale reference-aided attentive feature aggregation (MS-RAFA) mechanism is proposed, consisting of three main modules. First, to extract the most significant local features and strengthen the correlations between the features of various parts of the human body, an autoselect module (ASM) is designed, an attentional mechanism that can stack the structural information and spatial relations to form new features. Then, to realize multiscale feature fusion of the multiple output branches of the backbone network and increase feature diversity, we propose a multilayer feature fusion module (MFFM), which enables the model to mine the features hidden by salient features and to learn features better. Finally, to supervise the MFFM and make the network obtain better recognition features, we propose a multiple supervision mechanism. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on three large-scale datasets.https://ieeexplore.ieee.org/document/9568915/Feature correlationmultiscale reference-aidedmultilayer feature fusionperson re-identification
spellingShingle	Li Xu Xiang Fu Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification IEEE Access Feature correlation multiscale reference-aided multilayer feature fusion person re-identification
title	Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_full	Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_fullStr	Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_full_unstemmed	Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_short	Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_sort	multiscale reference aided attentive feature aggregation for person re identification
topic	Feature correlation multiscale reference-aided multilayer feature fusion person re-identification
url	https://ieeexplore.ieee.org/document/9568915/
work_keys_str_mv	AT lixu multiscalereferenceaidedattentivefeatureaggregationforpersonreidentification AT xiangfu multiscalereferenceaidedattentivefeatureaggregationforpersonreidentification

Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification

Similar Items