Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective

Person re-identification (Re-ID) aims to retrieve a particular pedestrian’s identification from a surveillance system consisting of non-overlapping cameras. In recent years, researchers have begun to focus on open-world person Re-ID tasks based on non-ideal situations. One of the most representative...

Full description

Bibliographic Details
Main Authors:	Minghui Liu, Yafei Zhang, Huafeng Li
Format:	Article
Language:	English
Published:	MDPI AG 2023-01-01
Series:	Mathematics
Subjects:	cross-modal person re-identification review mathematical perspective
Online Access:	https://www.mdpi.com/2227-7390/11/3/654

_version_	1797623827218300928
author	Minghui Liu Yafei Zhang Huafeng Li
author_facet	Minghui Liu Yafei Zhang Huafeng Li
author_sort	Minghui Liu
collection	DOAJ
description	Person re-identification (Re-ID) aims to retrieve a particular pedestrian’s identification from a surveillance system consisting of non-overlapping cameras. In recent years, researchers have begun to focus on open-world person Re-ID tasks based on non-ideal situations. One of the most representative of these is cross-modal person Re-ID, which aims to match probe data with target data from different modalities. According to the modalities of probe and target data, we divided cross-modal person Re-ID into visible–infrared, visible–depth, visible–sketch, and visible–text person Re-ID. In cross-modal person Re-ID, the most challenging problem is the modal gap. According to the different methods of narrowing the modal gap, we classified the existing works into picture-based style conversion methods, feature-based modality-invariant embedding mapping methods, and modality-unrelated auxiliary information mining methods. In addition, by generalizing the aforementioned works, we find that although deep-learning-based models perform well, the black-box-like learning process makes these models less interpretable and generalized. Therefore, we attempted to interpret different cross-modal person Re-ID models from a mathematical perspective. Through the above work, we attempt to compensate for the lack of mathematical interpretation of models in previous person Re-ID reviews and hope that our work will bring new inspiration to researchers.
first_indexed	2024-03-11T09:34:16Z
format	Article
id	doaj.art-543f0468dcf845e9a9b7421ab92e62d5
institution	Directory Open Access Journal
issn	2227-7390
language	English
last_indexed	2024-03-11T09:34:16Z
publishDate	2023-01-01
publisher	MDPI AG
record_format	Article
series	Mathematics
spelling	doaj.art-543f0468dcf845e9a9b7421ab92e62d52023-11-16T17:22:42ZengMDPI AGMathematics2227-73902023-01-0111365410.3390/math11030654Survey of Cross-Modal Person Re-Identification from a Mathematical PerspectiveMinghui Liu0Yafei Zhang1Huafeng Li2Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, ChinaPerson re-identification (Re-ID) aims to retrieve a particular pedestrian’s identification from a surveillance system consisting of non-overlapping cameras. In recent years, researchers have begun to focus on open-world person Re-ID tasks based on non-ideal situations. One of the most representative of these is cross-modal person Re-ID, which aims to match probe data with target data from different modalities. According to the modalities of probe and target data, we divided cross-modal person Re-ID into visible–infrared, visible–depth, visible–sketch, and visible–text person Re-ID. In cross-modal person Re-ID, the most challenging problem is the modal gap. According to the different methods of narrowing the modal gap, we classified the existing works into picture-based style conversion methods, feature-based modality-invariant embedding mapping methods, and modality-unrelated auxiliary information mining methods. In addition, by generalizing the aforementioned works, we find that although deep-learning-based models perform well, the black-box-like learning process makes these models less interpretable and generalized. Therefore, we attempted to interpret different cross-modal person Re-ID models from a mathematical perspective. Through the above work, we attempt to compensate for the lack of mathematical interpretation of models in previous person Re-ID reviews and hope that our work will bring new inspiration to researchers.https://www.mdpi.com/2227-7390/11/3/654cross-modal person re-identificationreviewmathematical perspective
spellingShingle	Minghui Liu Yafei Zhang Huafeng Li Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective Mathematics cross-modal person re-identification review mathematical perspective
title	Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective
title_full	Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective
title_fullStr	Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective
title_full_unstemmed	Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective
title_short	Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective
title_sort	survey of cross modal person re identification from a mathematical perspective
topic	cross-modal person re-identification review mathematical perspective
url	https://www.mdpi.com/2227-7390/11/3/654
work_keys_str_mv	AT minghuiliu surveyofcrossmodalpersonreidentificationfromamathematicalperspective AT yafeizhang surveyofcrossmodalpersonreidentificationfromamathematicalperspective AT huafengli surveyofcrossmodalpersonreidentificationfromamathematicalperspective

Survey of Cross-Modal Person Re-Identification from a Mathematical Perspective

Similar Items