Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey

The rise of deep learning technology has broadly promoted the practical application of artificial intelligence in production and daily life. In computer vision, many human-centered applications, such as video surveillance, human-computer interaction, digital entertainment, etc., rely heavily on accu...

Full description

Bibliographic Details
Main Authors:	Dejun Zhang, Yiqi Wu, Mingyue Guo, Yilin Chen
Format:	Article
Language:	English
Published:	MDPI AG 2021-09-01
Series:	Electronics
Subjects:	3D human pose estimation deep learning unsupervised semi-supervised fully-supervised weakly-supervised
Online Access:	https://www.mdpi.com/2079-9292/10/18/2267

_version_	1797519588611588096
author	Dejun Zhang Yiqi Wu Mingyue Guo Yilin Chen
author_facet	Dejun Zhang Yiqi Wu Mingyue Guo Yilin Chen
author_sort	Dejun Zhang
collection	DOAJ
description	The rise of deep learning technology has broadly promoted the practical application of artificial intelligence in production and daily life. In computer vision, many human-centered applications, such as video surveillance, human-computer interaction, digital entertainment, etc., rely heavily on accurate and efficient human pose estimation techniques. Inspired by the remarkable achievements in learning-based 2D human pose estimation, numerous research studies are devoted to the topic of 3D human pose estimation via deep learning methods. Against this backdrop, this paper provides an extensive literature survey of recent literature about deep learning methods for 3D human pose estimation to display the development process of these research studies, track the latest research trends, and analyze the characteristics of devised types of methods. The literature is reviewed, along with the general pipeline of 3D human pose estimation, which consists of human body modeling, learning-based pose estimation, and regularization for refinement. Different from existing reviews of the same topic, this paper focus on deep learning-based methods. The learning-based pose estimation is discussed from two categories: single-person and multi-person. Each one is further categorized by data type to the image-based methods and the video-based methods. Moreover, due to the significance of data for learning-based methods, this paper surveys the 3D human pose estimation methods according to the taxonomy of supervision form. At last, this paper also enlists the current and widely used datasets and compares performances of reviewed methods. Based on this literature survey, it can be concluded that each branch of 3D human pose estimation starts with fully-supervised methods, and there is still much room for multi-person pose estimation based on other supervision methods from both image and video. Besides the significant development of 3D human pose estimation via deep learning, the inherent ambiguity and occlusion problems remain challenging issues that need to be better addressed.
first_indexed	2024-03-10T07:44:55Z
format	Article
id	doaj.art-e0b1cc939cad4b54b5e8f8a08d002487
institution	Directory Open Access Journal
issn	2079-9292
language	English
last_indexed	2024-03-10T07:44:55Z
publishDate	2021-09-01
publisher	MDPI AG
record_format	Article
series	Electronics
spelling	doaj.art-e0b1cc939cad4b54b5e8f8a08d0024872023-11-22T12:48:25ZengMDPI AGElectronics2079-92922021-09-011018226710.3390/electronics10182267Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A SurveyDejun Zhang0Yiqi Wu1Mingyue Guo2Yilin Chen3School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, ChinaCollege of Computer Science, China University of Geosciences, Wuhan 430078, ChinaCollege of Information and Engineering, Sichuan Agricultural University, Yaan 625014, ChinaSchool of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, ChinaThe rise of deep learning technology has broadly promoted the practical application of artificial intelligence in production and daily life. In computer vision, many human-centered applications, such as video surveillance, human-computer interaction, digital entertainment, etc., rely heavily on accurate and efficient human pose estimation techniques. Inspired by the remarkable achievements in learning-based 2D human pose estimation, numerous research studies are devoted to the topic of 3D human pose estimation via deep learning methods. Against this backdrop, this paper provides an extensive literature survey of recent literature about deep learning methods for 3D human pose estimation to display the development process of these research studies, track the latest research trends, and analyze the characteristics of devised types of methods. The literature is reviewed, along with the general pipeline of 3D human pose estimation, which consists of human body modeling, learning-based pose estimation, and regularization for refinement. Different from existing reviews of the same topic, this paper focus on deep learning-based methods. The learning-based pose estimation is discussed from two categories: single-person and multi-person. Each one is further categorized by data type to the image-based methods and the video-based methods. Moreover, due to the significance of data for learning-based methods, this paper surveys the 3D human pose estimation methods according to the taxonomy of supervision form. At last, this paper also enlists the current and widely used datasets and compares performances of reviewed methods. Based on this literature survey, it can be concluded that each branch of 3D human pose estimation starts with fully-supervised methods, and there is still much room for multi-person pose estimation based on other supervision methods from both image and video. Besides the significant development of 3D human pose estimation via deep learning, the inherent ambiguity and occlusion problems remain challenging issues that need to be better addressed.https://www.mdpi.com/2079-9292/10/18/22673D human pose estimationdeep learningunsupervisedsemi-supervisedfully-supervisedweakly-supervised
spellingShingle	Dejun Zhang Yiqi Wu Mingyue Guo Yilin Chen Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey Electronics 3D human pose estimation deep learning unsupervised semi-supervised fully-supervised weakly-supervised
title	Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey
title_full	Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey
title_fullStr	Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey
title_full_unstemmed	Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey
title_short	Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey
title_sort	deep learning methods for 3d human pose estimation under different supervision paradigms a survey
topic	3D human pose estimation deep learning unsupervised semi-supervised fully-supervised weakly-supervised
url	https://www.mdpi.com/2079-9292/10/18/2267
work_keys_str_mv	AT dejunzhang deeplearningmethodsfor3dhumanposeestimationunderdifferentsupervisionparadigmsasurvey AT yiqiwu deeplearningmethodsfor3dhumanposeestimationunderdifferentsupervisionparadigmsasurvey AT mingyueguo deeplearningmethodsfor3dhumanposeestimationunderdifferentsupervisionparadigmsasurvey AT yilinchen deeplearningmethodsfor3dhumanposeestimationunderdifferentsupervisionparadigmsasurvey

Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey

Similar Items