Implicit Semantic Data Augmentation for Hand Pose Estimation

Data augmentation is a well-known technique used for improving the generalization performance of modern neural networks. After the success of several traditional random data augmentation for images (including flipping, translation, or rotation), a recent surge of interest in implicit data augmentati...

Full description

Bibliographic Details
Main Authors: Kyeongeun Seo, Hyeonjoong Cho, Daewoong Choi, Ju-Derk Park
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9853211/
_version_ 1828182149167054848
author Kyeongeun Seo
Hyeonjoong Cho
Daewoong Choi
Ju-Derk Park
author_facet Kyeongeun Seo
Hyeonjoong Cho
Daewoong Choi
Ju-Derk Park
author_sort Kyeongeun Seo
collection DOAJ
description Data augmentation is a well-known technique used for improving the generalization performance of modern neural networks. After the success of several traditional random data augmentation for images (including flipping, translation, or rotation), a recent surge of interest in implicit data augmentation techniques occurs to complement random data augmentation techniques. Implicit data augmentation augments training samples in feature space, rather than in pixel space, resulting in the generation of semantically meaningful data. Several techniques on implicit data augmentation have been introduced for classification tasks. However, few approaches have been introduced for regression tasks with continuous/structured labels, such as object pose estimation. Hence, we are motivated to propose a method for implicit semantic data augmentation for hand pose estimation. By considering semantic distances of hand poses, the proposed method implicitly generates extra training samples in feature space. We propose two additional techniques to improve the performance of this augmentation: metric learning and hand-dependent augmentation. Metric learning aims to learn feature representations to reflect the semantic distance of data. For hand pose estimation, the distribution of augmented hand poses can be regulated by managing the distribution of feature representations. Meanwhile, hand-dependent augmentation is specifically designed for hand pose estimation to prevent semantically meaningless hand poses from being generated (e.g., hands generated by simple interpolation between both hands). Further, we demonstrate the effectiveness of the proposed technique using two well-known hand pose datasets: STB and RHD.
first_indexed 2024-04-12T06:13:36Z
format Article
id doaj.art-3bf0238d17d5442cbf3d4c89a2eb79b7
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T06:13:36Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-3bf0238d17d5442cbf3d4c89a2eb79b72022-12-22T03:44:36ZengIEEEIEEE Access2169-35362022-01-0110846808468810.1109/ACCESS.2022.31977499853211Implicit Semantic Data Augmentation for Hand Pose EstimationKyeongeun Seo0Hyeonjoong Cho1https://orcid.org/0000-0003-1487-895XDaewoong Choi2https://orcid.org/0000-0003-3554-9848Ju-Derk Park3Information Media Research Center, Korea Electronics Technology Institute, Seoul, South KoreaDepartment of Computer Convergence Software, Korea University, Sejong, South KoreaDepartment of Computer Convergence Software, Korea University, Sejong, South KoreaElectronics and Telecommunications Research Institute, Daejeon, South KoreaData augmentation is a well-known technique used for improving the generalization performance of modern neural networks. After the success of several traditional random data augmentation for images (including flipping, translation, or rotation), a recent surge of interest in implicit data augmentation techniques occurs to complement random data augmentation techniques. Implicit data augmentation augments training samples in feature space, rather than in pixel space, resulting in the generation of semantically meaningful data. Several techniques on implicit data augmentation have been introduced for classification tasks. However, few approaches have been introduced for regression tasks with continuous/structured labels, such as object pose estimation. Hence, we are motivated to propose a method for implicit semantic data augmentation for hand pose estimation. By considering semantic distances of hand poses, the proposed method implicitly generates extra training samples in feature space. We propose two additional techniques to improve the performance of this augmentation: metric learning and hand-dependent augmentation. Metric learning aims to learn feature representations to reflect the semantic distance of data. For hand pose estimation, the distribution of augmented hand poses can be regulated by managing the distribution of feature representations. Meanwhile, hand-dependent augmentation is specifically designed for hand pose estimation to prevent semantically meaningless hand poses from being generated (e.g., hands generated by simple interpolation between both hands). Further, we demonstrate the effectiveness of the proposed technique using two well-known hand pose datasets: STB and RHD.https://ieeexplore.ieee.org/document/9853211/Hand pose estimationdata augmentationsemantic learningfeature learning
spellingShingle Kyeongeun Seo
Hyeonjoong Cho
Daewoong Choi
Ju-Derk Park
Implicit Semantic Data Augmentation for Hand Pose Estimation
IEEE Access
Hand pose estimation
data augmentation
semantic learning
feature learning
title Implicit Semantic Data Augmentation for Hand Pose Estimation
title_full Implicit Semantic Data Augmentation for Hand Pose Estimation
title_fullStr Implicit Semantic Data Augmentation for Hand Pose Estimation
title_full_unstemmed Implicit Semantic Data Augmentation for Hand Pose Estimation
title_short Implicit Semantic Data Augmentation for Hand Pose Estimation
title_sort implicit semantic data augmentation for hand pose estimation
topic Hand pose estimation
data augmentation
semantic learning
feature learning
url https://ieeexplore.ieee.org/document/9853211/
work_keys_str_mv AT kyeongeunseo implicitsemanticdataaugmentationforhandposeestimation
AT hyeonjoongcho implicitsemanticdataaugmentationforhandposeestimation
AT daewoongchoi implicitsemanticdataaugmentationforhandposeestimation
AT juderkpark implicitsemanticdataaugmentationforhandposeestimation