Sim2real transfer learning for 3D human pose estimation: motion to the rescue

Synthetic visual data can provide practicically infinite diversity and rich labels, while avoiding ethical issues with privacy and bias. However, for many tasks, current models trained on synthetic data generalize poorly to real data. The task of 3D human pose estimation is a particularly interestin...

Full description

Bibliographic Details
Main Authors:	Doersch, C, Zisserman, A
Format:	Conference item
Language:	English
Published:	Curran Associates 2020

_version_	1826267215933997056
author	Doersch, C Zisserman, A
author_facet	Doersch, C Zisserman, A
author_sort	Doersch, C
collection	OXFORD
description	Synthetic visual data can provide practicically infinite diversity and rich labels, while avoiding ethical issues with privacy and bias. However, for many tasks, current models trained on synthetic data generalize poorly to real data. The task of 3D human pose estimation is a particularly interesting example of this sim2real problem, because learning-based approaches perform reasonably well given real training data, yet labeled 3D poses are extremely difficult to obtain in the wild, limiting scalability. In this paper, we show that standard neural-network approaches, which perform poorly when trained on synthetic RGB images, can perform well when the data is pre-processed to extract cues about the person’s motion, notably as optical flow and the motion of 2D keypoints. Therefore, our results suggest that motion can be a simple way to bridge a sim2real gap when video is available. We evaluate on the 3D Poses in the Wild dataset, the most challenging modern benchmark for 3D pose estimation, where we show full 3D mesh recovery that is on par with state-of-the-art methods trained on real 3D sequences, despite training only on synthetic humans from the SURREAL dataset.
first_indexed	2024-03-06T20:50:47Z
format	Conference item
id	oxford-uuid:378006c0-e772-481b-a1ea-d0dd2db5a403
institution	University of Oxford
language	English
last_indexed	2024-03-06T20:50:47Z
publishDate	2020
publisher	Curran Associates
record_format	dspace
spelling	oxford-uuid:378006c0-e772-481b-a1ea-d0dd2db5a4032022-03-26T13:44:28ZSim2real transfer learning for 3D human pose estimation: motion to the rescueConference itemhttp://purl.org/coar/resource_type/c_5794uuid:378006c0-e772-481b-a1ea-d0dd2db5a403EnglishSymplectic Elements Curran Associates2020Doersch, CZisserman, ASynthetic visual data can provide practicically infinite diversity and rich labels, while avoiding ethical issues with privacy and bias. However, for many tasks, current models trained on synthetic data generalize poorly to real data. The task of 3D human pose estimation is a particularly interesting example of this sim2real problem, because learning-based approaches perform reasonably well given real training data, yet labeled 3D poses are extremely difficult to obtain in the wild, limiting scalability. In this paper, we show that standard neural-network approaches, which perform poorly when trained on synthetic RGB images, can perform well when the data is pre-processed to extract cues about the person’s motion, notably as optical flow and the motion of 2D keypoints. Therefore, our results suggest that motion can be a simple way to bridge a sim2real gap when video is available. We evaluate on the 3D Poses in the Wild dataset, the most challenging modern benchmark for 3D pose estimation, where we show full 3D mesh recovery that is on par with state-of-the-art methods trained on real 3D sequences, despite training only on synthetic humans from the SURREAL dataset.
spellingShingle	Doersch, C Zisserman, A Sim2real transfer learning for 3D human pose estimation: motion to the rescue
title	Sim2real transfer learning for 3D human pose estimation: motion to the rescue
title_full	Sim2real transfer learning for 3D human pose estimation: motion to the rescue
title_fullStr	Sim2real transfer learning for 3D human pose estimation: motion to the rescue
title_full_unstemmed	Sim2real transfer learning for 3D human pose estimation: motion to the rescue
title_short	Sim2real transfer learning for 3D human pose estimation: motion to the rescue
title_sort	sim2real transfer learning for 3d human pose estimation motion to the rescue
work_keys_str_mv	AT doerschc sim2realtransferlearningfor3dhumanposeestimationmotiontotherescue AT zissermana sim2realtransferlearningfor3dhumanposeestimationmotiontotherescue

Sim2real transfer learning for 3D human pose estimation: motion to the rescue

Similar Items