Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification

In person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identificatio...

Full description

Bibliographic Details
Main Authors: Li Xu, Xiang Fu
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9568915/
_version_ 1818365739316281344
author Li Xu
Xiang Fu
author_facet Li Xu
Xiang Fu
author_sort Li Xu
collection DOAJ
description In person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identification tasks, the standard CNNs extract local features but are unable to obtain satisfactory results for global features extracted from the images. Local feature learning methods are helpful for obtaining more abundant features, which focus on the most significant local features and ignore the correlations between features of various parts of the human body. To solve the above problems, a new multiscale reference-aided attentive feature aggregation (MS-RAFA) mechanism is proposed, consisting of three main modules. First, to extract the most significant local features and strengthen the correlations between the features of various parts of the human body, an autoselect module (ASM) is designed, an attentional mechanism that can stack the structural information and spatial relations to form new features. Then, to realize multiscale feature fusion of the multiple output branches of the backbone network and increase feature diversity, we propose a multilayer feature fusion module (MFFM), which enables the model to mine the features hidden by salient features and to learn features better. Finally, to supervise the MFFM and make the network obtain better recognition features, we propose a multiple supervision mechanism. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on three large-scale datasets.
first_indexed 2024-12-13T22:25:03Z
format Article
id doaj.art-e8e4a3a54d8140ab8665a62dd3cf5a27
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-13T22:25:03Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-e8e4a3a54d8140ab8665a62dd3cf5a272022-12-21T23:29:15ZengIEEEIEEE Access2169-35362021-01-01914166714167710.1109/ACCESS.2021.31195769568915Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-IdentificationLi Xu0https://orcid.org/0000-0001-9793-8326Xiang Fu1https://orcid.org/0000-0002-4337-6206Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, ChinaKey Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, ChinaIn person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identification tasks, the standard CNNs extract local features but are unable to obtain satisfactory results for global features extracted from the images. Local feature learning methods are helpful for obtaining more abundant features, which focus on the most significant local features and ignore the correlations between features of various parts of the human body. To solve the above problems, a new multiscale reference-aided attentive feature aggregation (MS-RAFA) mechanism is proposed, consisting of three main modules. First, to extract the most significant local features and strengthen the correlations between the features of various parts of the human body, an autoselect module (ASM) is designed, an attentional mechanism that can stack the structural information and spatial relations to form new features. Then, to realize multiscale feature fusion of the multiple output branches of the backbone network and increase feature diversity, we propose a multilayer feature fusion module (MFFM), which enables the model to mine the features hidden by salient features and to learn features better. Finally, to supervise the MFFM and make the network obtain better recognition features, we propose a multiple supervision mechanism. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on three large-scale datasets.https://ieeexplore.ieee.org/document/9568915/Feature correlationmultiscale reference-aidedmultilayer feature fusionperson re-identification
spellingShingle Li Xu
Xiang Fu
Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
IEEE Access
Feature correlation
multiscale reference-aided
multilayer feature fusion
person re-identification
title Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_full Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_fullStr Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_full_unstemmed Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_short Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
title_sort multiscale reference aided attentive feature aggregation for person re identification
topic Feature correlation
multiscale reference-aided
multilayer feature fusion
person re-identification
url https://ieeexplore.ieee.org/document/9568915/
work_keys_str_mv AT lixu multiscalereferenceaidedattentivefeatureaggregationforpersonreidentification
AT xiangfu multiscalereferenceaidedattentivefeatureaggregationforpersonreidentification