Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification
In person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identificatio...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9568915/ |
_version_ | 1818365739316281344 |
---|---|
author | Li Xu Xiang Fu |
author_facet | Li Xu Xiang Fu |
author_sort | Li Xu |
collection | DOAJ |
description | In person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identification tasks, the standard CNNs extract local features but are unable to obtain satisfactory results for global features extracted from the images. Local feature learning methods are helpful for obtaining more abundant features, which focus on the most significant local features and ignore the correlations between features of various parts of the human body. To solve the above problems, a new multiscale reference-aided attentive feature aggregation (MS-RAFA) mechanism is proposed, consisting of three main modules. First, to extract the most significant local features and strengthen the correlations between the features of various parts of the human body, an autoselect module (ASM) is designed, an attentional mechanism that can stack the structural information and spatial relations to form new features. Then, to realize multiscale feature fusion of the multiple output branches of the backbone network and increase feature diversity, we propose a multilayer feature fusion module (MFFM), which enables the model to mine the features hidden by salient features and to learn features better. Finally, to supervise the MFFM and make the network obtain better recognition features, we propose a multiple supervision mechanism. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on three large-scale datasets. |
first_indexed | 2024-12-13T22:25:03Z |
format | Article |
id | doaj.art-e8e4a3a54d8140ab8665a62dd3cf5a27 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-13T22:25:03Z |
publishDate | 2021-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-e8e4a3a54d8140ab8665a62dd3cf5a272022-12-21T23:29:15ZengIEEEIEEE Access2169-35362021-01-01914166714167710.1109/ACCESS.2021.31195769568915Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-IdentificationLi Xu0https://orcid.org/0000-0001-9793-8326Xiang Fu1https://orcid.org/0000-0002-4337-6206Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, ChinaKey Laboratory of Jiangxi Province for Image Processing and Pattern Recognition, Nanchang Hangkong University, Nanchang, ChinaIn person re-identification (Re-ID), increasing the diversity of pedestrian features can improve recognition accuracy. In standard convolutional neural networks (CNNs), the receptive fields of neurons in each layer are designed to have the same size. Therefore, in complex pedestrian re-identification tasks, the standard CNNs extract local features but are unable to obtain satisfactory results for global features extracted from the images. Local feature learning methods are helpful for obtaining more abundant features, which focus on the most significant local features and ignore the correlations between features of various parts of the human body. To solve the above problems, a new multiscale reference-aided attentive feature aggregation (MS-RAFA) mechanism is proposed, consisting of three main modules. First, to extract the most significant local features and strengthen the correlations between the features of various parts of the human body, an autoselect module (ASM) is designed, an attentional mechanism that can stack the structural information and spatial relations to form new features. Then, to realize multiscale feature fusion of the multiple output branches of the backbone network and increase feature diversity, we propose a multilayer feature fusion module (MFFM), which enables the model to mine the features hidden by salient features and to learn features better. Finally, to supervise the MFFM and make the network obtain better recognition features, we propose a multiple supervision mechanism. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on three large-scale datasets.https://ieeexplore.ieee.org/document/9568915/Feature correlationmultiscale reference-aidedmultilayer feature fusionperson re-identification |
spellingShingle | Li Xu Xiang Fu Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification IEEE Access Feature correlation multiscale reference-aided multilayer feature fusion person re-identification |
title | Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification |
title_full | Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification |
title_fullStr | Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification |
title_full_unstemmed | Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification |
title_short | Multiscale Reference-Aided Attentive Feature Aggregation for Person Re-Identification |
title_sort | multiscale reference aided attentive feature aggregation for person re identification |
topic | Feature correlation multiscale reference-aided multilayer feature fusion person re-identification |
url | https://ieeexplore.ieee.org/document/9568915/ |
work_keys_str_mv | AT lixu multiscalereferenceaidedattentivefeatureaggregationforpersonreidentification AT xiangfu multiscalereferenceaidedattentivefeatureaggregationforpersonreidentification |