Multi‐granularity re‐ranking for visible‐infrared person re‐identification

Abstract Visible‐infrared person re‐identification (VI‐ReID) is a supplementary task of single‐modality re‐identification, which makes up for the defect of conventional re‐identification under insufficient illumination. It is more challenging than single‐modality ReID because, in addition to difficu...

Full description

Bibliographic Details
Main Authors: Yadi Wang, Hongyun Zhang, Duoqian Miao, Witold Pedrycz
Format: Article
Language:English
Published: Wiley 2023-09-01
Series:CAAI Transactions on Intelligence Technology
Subjects:
Online Access:https://doi.org/10.1049/cit2.12182
_version_ 1797682962754437120
author Yadi Wang
Hongyun Zhang
Duoqian Miao
Witold Pedrycz
author_facet Yadi Wang
Hongyun Zhang
Duoqian Miao
Witold Pedrycz
author_sort Yadi Wang
collection DOAJ
description Abstract Visible‐infrared person re‐identification (VI‐ReID) is a supplementary task of single‐modality re‐identification, which makes up for the defect of conventional re‐identification under insufficient illumination. It is more challenging than single‐modality ReID because, in addition to difficulties in pedestrian posture, camera shooting angle and background change, there are also difficulties in the cross‐modality gap. Existing works only involve coarse‐grained global features in the re‐ranking calculation, which cannot effectively use fine‐grained features. However, fine‐grained features are particularly important due to the lack of information in cross‐modality re‐ID. To this end, the Q‐center Multi‐granularity K‐reciprocal Re‐ranking Algorithm (termed QCMR) is proposed, including a Q‐nearest neighbour centre encoder (termed QNC) and a Multi‐granularity K‐reciprocal Encoder (termed MGK) for a more comprehensive feature representation. QNC converts the probe‐corresponding modality features into gallery corresponding modality features through modality transfer to narrow the modality gap. MGK takes a coarse‐grained mutual nearest neighbour as the dominant and combines a fine‐grained nearest neighbour as a supplement for similarity measurement. Extensive experiments on two widely used VI‐ReID benchmarks, SYSU‐MM01 and RegDB have shown that our method achieves state‐of‐the‐art results. Especially, the mAP of SYSU‐MM01 is increased by 5.9% in all‐search mode.
first_indexed 2024-03-12T00:07:30Z
format Article
id doaj.art-c731fc73df0e4284bca5b1dbf1f25b9f
institution Directory Open Access Journal
issn 2468-2322
language English
last_indexed 2024-03-12T00:07:30Z
publishDate 2023-09-01
publisher Wiley
record_format Article
series CAAI Transactions on Intelligence Technology
spelling doaj.art-c731fc73df0e4284bca5b1dbf1f25b9f2023-09-16T16:19:34ZengWileyCAAI Transactions on Intelligence Technology2468-23222023-09-018377077910.1049/cit2.12182Multi‐granularity re‐ranking for visible‐infrared person re‐identificationYadi Wang0Hongyun Zhang1Duoqian Miao2Witold Pedrycz3Department of Computer Science and Technology Tongji University Shanghai ChinaDepartment of Computer Science and Technology Tongji University Shanghai ChinaDepartment of Computer Science and Technology Tongji University Shanghai ChinaDepartment of Electrical and Computer Engineering University of Alberta Edmonton Alberta CanadaAbstract Visible‐infrared person re‐identification (VI‐ReID) is a supplementary task of single‐modality re‐identification, which makes up for the defect of conventional re‐identification under insufficient illumination. It is more challenging than single‐modality ReID because, in addition to difficulties in pedestrian posture, camera shooting angle and background change, there are also difficulties in the cross‐modality gap. Existing works only involve coarse‐grained global features in the re‐ranking calculation, which cannot effectively use fine‐grained features. However, fine‐grained features are particularly important due to the lack of information in cross‐modality re‐ID. To this end, the Q‐center Multi‐granularity K‐reciprocal Re‐ranking Algorithm (termed QCMR) is proposed, including a Q‐nearest neighbour centre encoder (termed QNC) and a Multi‐granularity K‐reciprocal Encoder (termed MGK) for a more comprehensive feature representation. QNC converts the probe‐corresponding modality features into gallery corresponding modality features through modality transfer to narrow the modality gap. MGK takes a coarse‐grained mutual nearest neighbour as the dominant and combines a fine‐grained nearest neighbour as a supplement for similarity measurement. Extensive experiments on two widely used VI‐ReID benchmarks, SYSU‐MM01 and RegDB have shown that our method achieves state‐of‐the‐art results. Especially, the mAP of SYSU‐MM01 is increased by 5.9% in all‐search mode.https://doi.org/10.1049/cit2.12182computer visionrecognition
spellingShingle Yadi Wang
Hongyun Zhang
Duoqian Miao
Witold Pedrycz
Multi‐granularity re‐ranking for visible‐infrared person re‐identification
CAAI Transactions on Intelligence Technology
computer vision
recognition
title Multi‐granularity re‐ranking for visible‐infrared person re‐identification
title_full Multi‐granularity re‐ranking for visible‐infrared person re‐identification
title_fullStr Multi‐granularity re‐ranking for visible‐infrared person re‐identification
title_full_unstemmed Multi‐granularity re‐ranking for visible‐infrared person re‐identification
title_short Multi‐granularity re‐ranking for visible‐infrared person re‐identification
title_sort multi granularity re ranking for visible infrared person re identification
topic computer vision
recognition
url https://doi.org/10.1049/cit2.12182
work_keys_str_mv AT yadiwang multigranularityrerankingforvisibleinfraredpersonreidentification
AT hongyunzhang multigranularityrerankingforvisibleinfraredpersonreidentification
AT duoqianmiao multigranularityrerankingforvisibleinfraredpersonreidentification
AT witoldpedrycz multigranularityrerankingforvisibleinfraredpersonreidentification