Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification

Person re-identification (Re-ID) aims to retrieve a specific pedestrian across a multi-disjoint camera in a surveillance system. Most of the research is based on a strong assumption that images should contain a full human torso. However, it cannot be guaranteed that all the people have a clear foreg...

Full description

Bibliographic Details
Main Authors: Zhi Liu, Xingyu Mu, Shidu Dong, Yunhua Lu, Mingzi Jiang
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/14/7/1454
_version_ 1797415376060940288
author Zhi Liu
Xingyu Mu
Shidu Dong
Yunhua Lu
Mingzi Jiang
author_facet Zhi Liu
Xingyu Mu
Shidu Dong
Yunhua Lu
Mingzi Jiang
author_sort Zhi Liu
collection DOAJ
description Person re-identification (Re-ID) aims to retrieve a specific pedestrian across a multi-disjoint camera in a surveillance system. Most of the research is based on a strong assumption that images should contain a full human torso. However, it cannot be guaranteed that all the people have a clear foreground because they are out of constraint. In the real world, a variety of occluded situations frequently appear in video monitoring, which impedes the recognition process. To settle the occluded person Re-ID issue, a new Dual-Transformer symmetric architecture is proposed in this work, which can reduce the occluded impact and build a multi-scale feature. There are two contributions to our proposed model. (i) A Transformer-Aware Patch Searching (TAPS) module is devised to learn visible human region distribution using a multiheaded self-attention mechanism and construct a branch of distributed information attention scale. (ii) An Adaptive Visible-Part Cropping (AVPC) Strategy, with two steps of cropping and weakly-supervised learning, is used to generate a fine-scale visible image for another branch. Only ID labels are utilized to restrain TAPS and AVPC without any extra visible-part annotation. Extensive experiments are conducted on two occluded person Re-ID benchmarks, confirming that our approach performs a SOTA or comparable effect.
first_indexed 2024-03-09T05:47:45Z
format Article
id doaj.art-fcf380ba7888438d907080833b893618
institution Directory Open Access Journal
issn 2073-8994
language English
last_indexed 2024-03-09T05:47:45Z
publishDate 2022-07-01
publisher MDPI AG
record_format Article
series Symmetry
spelling doaj.art-fcf380ba7888438d907080833b8936182023-12-03T12:20:10ZengMDPI AGSymmetry2073-89942022-07-01147145410.3390/sym14071454Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-IdentificationZhi Liu0Xingyu Mu1Shidu Dong2Yunhua Lu3Mingzi Jiang4School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401120, ChinaSchool of Artificial Intelligence, Chongqing University of Technology, Chongqing 401120, ChinaSchool of Artificial Intelligence, Chongqing University of Technology, Chongqing 401120, ChinaSchool of Artificial Intelligence, Chongqing University of Technology, Chongqing 401120, ChinaSchool of Artificial Intelligence, Chongqing University of Technology, Chongqing 401120, ChinaPerson re-identification (Re-ID) aims to retrieve a specific pedestrian across a multi-disjoint camera in a surveillance system. Most of the research is based on a strong assumption that images should contain a full human torso. However, it cannot be guaranteed that all the people have a clear foreground because they are out of constraint. In the real world, a variety of occluded situations frequently appear in video monitoring, which impedes the recognition process. To settle the occluded person Re-ID issue, a new Dual-Transformer symmetric architecture is proposed in this work, which can reduce the occluded impact and build a multi-scale feature. There are two contributions to our proposed model. (i) A Transformer-Aware Patch Searching (TAPS) module is devised to learn visible human region distribution using a multiheaded self-attention mechanism and construct a branch of distributed information attention scale. (ii) An Adaptive Visible-Part Cropping (AVPC) Strategy, with two steps of cropping and weakly-supervised learning, is used to generate a fine-scale visible image for another branch. Only ID labels are utilized to restrain TAPS and AVPC without any extra visible-part annotation. Extensive experiments are conducted on two occluded person Re-ID benchmarks, confirming that our approach performs a SOTA or comparable effect.https://www.mdpi.com/2073-8994/14/7/1454person re-identificationmultiheaded self-attentionweakly-supervised learningmulti-scale feature
spellingShingle Zhi Liu
Xingyu Mu
Shidu Dong
Yunhua Lu
Mingzi Jiang
Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification
Symmetry
person re-identification
multiheaded self-attention
weakly-supervised learning
multi-scale feature
title Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification
title_full Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification
title_fullStr Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification
title_full_unstemmed Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification
title_short Constructing Adaptive Multi-Scale Feature via Transformer-Aware Patch for Occluded Person Re-Identification
title_sort constructing adaptive multi scale feature via transformer aware patch for occluded person re identification
topic person re-identification
multiheaded self-attention
weakly-supervised learning
multi-scale feature
url https://www.mdpi.com/2073-8994/14/7/1454
work_keys_str_mv AT zhiliu constructingadaptivemultiscalefeatureviatransformerawarepatchforoccludedpersonreidentification
AT xingyumu constructingadaptivemultiscalefeatureviatransformerawarepatchforoccludedpersonreidentification
AT shidudong constructingadaptivemultiscalefeatureviatransformerawarepatchforoccludedpersonreidentification
AT yunhualu constructingadaptivemultiscalefeatureviatransformerawarepatchforoccludedpersonreidentification
AT mingzijiang constructingadaptivemultiscalefeatureviatransformerawarepatchforoccludedpersonreidentification