A Universal Decoupled Training Framework for Human Parsing

Human parsing is an important technology in human–robot interaction systems. At present, the distribution of multi-category human parsing datasets is unbalanced, and the samples present a long-tailed distribution, which directly affects the performance of human parsing. Meanwhile, the similarity bet...

Full description

Bibliographic Details
Main Authors: Yang Li, Huahong Zuo, Ping Han
Format: Article
Language:English
Published: MDPI AG 2022-08-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/16/5964
_version_ 1827617560712970240
author Yang Li
Huahong Zuo
Ping Han
author_facet Yang Li
Huahong Zuo
Ping Han
author_sort Yang Li
collection DOAJ
description Human parsing is an important technology in human–robot interaction systems. At present, the distribution of multi-category human parsing datasets is unbalanced, and the samples present a long-tailed distribution, which directly affects the performance of human parsing. Meanwhile, the similarity between different categories leads the model to predict false parsing results. To solve the above problems, a general decoupled training framework called Decoupled Training framework based on Pixel Resampling (DTPR) was proposed to solve the long-tailed distribution, and a new sampling method named Pixel Resampling based on Accuracy distribution (PRA) for semantic segmentation was also proposed and applied to this decoupled training framework. The framework divides the training process into two phases, the first phase is to improve the model feature extraction ability, and the second phase is to improve the performance of the model on tail categories. The training framework was evaluated in MHPv2.0 and LIP datasets, and tested in both high-precision and real-time SOTA models. The MPA metric of model trained by DTPR in above two datasets increased by more than 6%, and the mIoU metric increased by more than 1% without changing the model structure.
first_indexed 2024-03-09T09:49:50Z
format Article
id doaj.art-404f342fac8a416b9fed16ee83f6d423
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T09:49:50Z
publishDate 2022-08-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-404f342fac8a416b9fed16ee83f6d4232023-12-02T00:16:17ZengMDPI AGSensors1424-82202022-08-012216596410.3390/s22165964A Universal Decoupled Training Framework for Human ParsingYang Li0Huahong Zuo1Ping Han2School of Information Engineering, Wuhan University of Technology, Wuhan 430070, ChinaWuhan Chuyan Information Technology Co., Ltd., Wuhan 430030, ChinaSchool of Information Engineering, Wuhan University of Technology, Wuhan 430070, ChinaHuman parsing is an important technology in human–robot interaction systems. At present, the distribution of multi-category human parsing datasets is unbalanced, and the samples present a long-tailed distribution, which directly affects the performance of human parsing. Meanwhile, the similarity between different categories leads the model to predict false parsing results. To solve the above problems, a general decoupled training framework called Decoupled Training framework based on Pixel Resampling (DTPR) was proposed to solve the long-tailed distribution, and a new sampling method named Pixel Resampling based on Accuracy distribution (PRA) for semantic segmentation was also proposed and applied to this decoupled training framework. The framework divides the training process into two phases, the first phase is to improve the model feature extraction ability, and the second phase is to improve the performance of the model on tail categories. The training framework was evaluated in MHPv2.0 and LIP datasets, and tested in both high-precision and real-time SOTA models. The MPA metric of model trained by DTPR in above two datasets increased by more than 6%, and the mIoU metric increased by more than 1% without changing the model structure.https://www.mdpi.com/1424-8220/22/16/5964pixel resamplinglong-tailed distributionhuman parsingsemantic segmentation
spellingShingle Yang Li
Huahong Zuo
Ping Han
A Universal Decoupled Training Framework for Human Parsing
Sensors
pixel resampling
long-tailed distribution
human parsing
semantic segmentation
title A Universal Decoupled Training Framework for Human Parsing
title_full A Universal Decoupled Training Framework for Human Parsing
title_fullStr A Universal Decoupled Training Framework for Human Parsing
title_full_unstemmed A Universal Decoupled Training Framework for Human Parsing
title_short A Universal Decoupled Training Framework for Human Parsing
title_sort universal decoupled training framework for human parsing
topic pixel resampling
long-tailed distribution
human parsing
semantic segmentation
url https://www.mdpi.com/1424-8220/22/16/5964
work_keys_str_mv AT yangli auniversaldecoupledtrainingframeworkforhumanparsing
AT huahongzuo auniversaldecoupledtrainingframeworkforhumanparsing
AT pinghan auniversaldecoupledtrainingframeworkforhumanparsing
AT yangli universaldecoupledtrainingframeworkforhumanparsing
AT huahongzuo universaldecoupledtrainingframeworkforhumanparsing
AT pinghan universaldecoupledtrainingframeworkforhumanparsing