Human pose estimation using artificial intelligence

Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in ot...

Full description

Bibliographic Details
Main Author: Dai, Yue
Other Authors: Yap Kim Hui
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/159563
_version_ 1811687023125200896
author Dai, Yue
author2 Yap Kim Hui
author_facet Yap Kim Hui
Dai, Yue
author_sort Dai, Yue
collection NTU
description Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in other Computer Vision (CV) tasks, its application to HPE problems and ability to surpass Convolutional Neural Network (CNN) models is gaining attention. Apart from the deep learning model, data processing in HPE cannot be ignored. Through the coordinate system transformation in common HPE solutions, the accuracy degrades when flipping strategy is applied in inference. The bias is derived from using resolution measured in pixels instead of size measured in unit length when performing resizing transformation. In this dissertation, TransPose, one state-of-the-art model introducing transformer for HPE is studied. The model is trained and tested on COCO benchmark. The modified TransPose model with unbiased approach is experimented as well. Based on the results, the dependency function of attention map is analyzed. The insights gained from this work provides valuable insight for future exploration of transformer applications and data processing in human pose estimation.
first_indexed 2024-10-01T05:09:43Z
format Thesis-Master by Coursework
id ntu-10356/159563
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:09:43Z
publishDate 2022
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1595632022-06-27T06:13:42Z Human pose estimation using artificial intelligence Dai, Yue Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in other Computer Vision (CV) tasks, its application to HPE problems and ability to surpass Convolutional Neural Network (CNN) models is gaining attention. Apart from the deep learning model, data processing in HPE cannot be ignored. Through the coordinate system transformation in common HPE solutions, the accuracy degrades when flipping strategy is applied in inference. The bias is derived from using resolution measured in pixels instead of size measured in unit length when performing resizing transformation. In this dissertation, TransPose, one state-of-the-art model introducing transformer for HPE is studied. The model is trained and tested on COCO benchmark. The modified TransPose model with unbiased approach is experimented as well. Based on the results, the dependency function of attention map is analyzed. The insights gained from this work provides valuable insight for future exploration of transformer applications and data processing in human pose estimation. Master of Science (Signal Processing) 2022-06-27T06:13:41Z 2022-06-27T06:13:41Z 2022 Thesis-Master by Coursework Dai, Y. (2022). Human pose estimation using artificial intelligence. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159563 https://hdl.handle.net/10356/159563 en application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Dai, Yue
Human pose estimation using artificial intelligence
title Human pose estimation using artificial intelligence
title_full Human pose estimation using artificial intelligence
title_fullStr Human pose estimation using artificial intelligence
title_full_unstemmed Human pose estimation using artificial intelligence
title_short Human pose estimation using artificial intelligence
title_sort human pose estimation using artificial intelligence
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
url https://hdl.handle.net/10356/159563
work_keys_str_mv AT daiyue humanposeestimationusingartificialintelligence