Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework

Video-based person re-identification has become quite attractive due to its importance in many vision surveillance problems. It is a challenging topic due to the inter/intra changes, occlusion, and pose variations involved. In this paper, we propose a pyramid-attentive framework that relies on multi...

Full description

Bibliographic Details
Main Authors: Randa Mohamed Bayoumi, Elsayed E. Hemayed, Mohammad Ehab Ragab, Magda B. Fayek
Format: Article
Language:English
Published: MDPI AG 2022-02-01
Series:Big Data and Cognitive Computing
Subjects:
Online Access:https://www.mdpi.com/2504-2289/6/1/20
_version_ 1797472832728334336
author Randa Mohamed Bayoumi
Elsayed E. Hemayed
Mohammad Ehab Ragab
Magda B. Fayek
author_facet Randa Mohamed Bayoumi
Elsayed E. Hemayed
Mohammad Ehab Ragab
Magda B. Fayek
author_sort Randa Mohamed Bayoumi
collection DOAJ
description Video-based person re-identification has become quite attractive due to its importance in many vision surveillance problems. It is a challenging topic due to the inter/intra changes, occlusion, and pose variations involved. In this paper, we propose a pyramid-attentive framework that relies on multi-part features and multiple attention to aggregate features of multi-levels and learns attention-based representations of persons through various aspects. Self-attention is used to strengthen the most discriminative features in the spatial and channel domains and hence capture robust global information. We propose the use of part-relation attention between different multi-granularities of features’ representation to focus on learning appropriate local features. Temporal attention is used to aggregate temporal features. We integrate the most robust features in the global and multi-level views to build an effective convolution neural network (CNN) model. The proposed model outperforms the previous state-of-the art models on three datasets. Notably, using the proposed model enables the achievement of 98.9% (a relative improvement of 2.7% on the GRL) top1 accuracy and 99.3% mAP on the PRID2011, and 92.8% (a relative improvement of 2.4% relative to GRL) top1 accuracy on iLIDS-vid. We also explore the generalization ability of our model on a cross dataset.
first_indexed 2024-03-09T20:06:44Z
format Article
id doaj.art-649524bc3e5b4398808aa2a3aee604b6
institution Directory Open Access Journal
issn 2504-2289
language English
last_indexed 2024-03-09T20:06:44Z
publishDate 2022-02-01
publisher MDPI AG
record_format Article
series Big Data and Cognitive Computing
spelling doaj.art-649524bc3e5b4398808aa2a3aee604b62023-11-24T00:29:04ZengMDPI AGBig Data and Cognitive Computing2504-22892022-02-01612010.3390/bdcc6010020Person Re-Identification via Pyramid Multipart Features and Multi-Attention FrameworkRanda Mohamed Bayoumi0Elsayed E. Hemayed1Mohammad Ehab Ragab2Magda B. Fayek3Informatics Research Department, Electronics Research Institute, Giza 12622, EgyptComputer Engineering Department, Faculty of Engineering, Cairo University, Giza 12613, EgyptInformatics Research Department, Electronics Research Institute, Giza 12622, EgyptComputer Engineering Department, Faculty of Engineering, Cairo University, Giza 12613, EgyptVideo-based person re-identification has become quite attractive due to its importance in many vision surveillance problems. It is a challenging topic due to the inter/intra changes, occlusion, and pose variations involved. In this paper, we propose a pyramid-attentive framework that relies on multi-part features and multiple attention to aggregate features of multi-levels and learns attention-based representations of persons through various aspects. Self-attention is used to strengthen the most discriminative features in the spatial and channel domains and hence capture robust global information. We propose the use of part-relation attention between different multi-granularities of features’ representation to focus on learning appropriate local features. Temporal attention is used to aggregate temporal features. We integrate the most robust features in the global and multi-level views to build an effective convolution neural network (CNN) model. The proposed model outperforms the previous state-of-the art models on three datasets. Notably, using the proposed model enables the achievement of 98.9% (a relative improvement of 2.7% on the GRL) top1 accuracy and 99.3% mAP on the PRID2011, and 92.8% (a relative improvement of 2.4% relative to GRL) top1 accuracy on iLIDS-vid. We also explore the generalization ability of our model on a cross dataset.https://www.mdpi.com/2504-2289/6/1/20computer visiondeep learningperson re-identificationattentiontemporal aggregationmulti-granularities
spellingShingle Randa Mohamed Bayoumi
Elsayed E. Hemayed
Mohammad Ehab Ragab
Magda B. Fayek
Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework
Big Data and Cognitive Computing
computer vision
deep learning
person re-identification
attention
temporal aggregation
multi-granularities
title Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework
title_full Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework
title_fullStr Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework
title_full_unstemmed Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework
title_short Person Re-Identification via Pyramid Multipart Features and Multi-Attention Framework
title_sort person re identification via pyramid multipart features and multi attention framework
topic computer vision
deep learning
person re-identification
attention
temporal aggregation
multi-granularities
url https://www.mdpi.com/2504-2289/6/1/20
work_keys_str_mv AT randamohamedbayoumi personreidentificationviapyramidmultipartfeaturesandmultiattentionframework
AT elsayedehemayed personreidentificationviapyramidmultipartfeaturesandmultiattentionframework
AT mohammadehabragab personreidentificationviapyramidmultipartfeaturesandmultiattentionframework
AT magdabfayek personreidentificationviapyramidmultipartfeaturesandmultiattentionframework