Recurrent human pose estimation

We propose a ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part configuration. We make the following three contributions: (i) an ar...

Полное описание

Библиографические подробности
Главные авторы:	Belagiannis, V, Zisserman, A
Формат:	Conference item
Опубликовано:	Institute of Electrical and Electronics Engineers 2017

_version_	1826304613446320128
author	Belagiannis, V Zisserman, A
author_facet	Belagiannis, V Zisserman, A
author_sort	Belagiannis, V
collection	OXFORD
description	We propose a ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part configuration. We make the following three contributions: (i) an architecture combining a feed forward module with a recurrent module, where the recurrent module can be run iteratively to improve the performance; (ii) the model can be trained end-to-end and from scratch, with auxiliary losses incorporated to improve performance; (iii) we investigate whether keypoint visibility can also be predicted. The model is evaluated on two benchmark datasets. The result is a simple architecture that achieves performance on par with the state of the art, but without the complexity of a graphical model stage (or layers).
first_indexed	2024-03-07T06:20:29Z
format	Conference item
id	oxford-uuid:f28ba2c9-b4fb-41b6-9927-79f38b4ce388
institution	University of Oxford
last_indexed	2024-03-07T06:20:29Z
publishDate	2017
publisher	Institute of Electrical and Electronics Engineers
record_format	dspace
spelling	oxford-uuid:f28ba2c9-b4fb-41b6-9927-79f38b4ce3882022-03-27T12:04:41ZRecurrent human pose estimationConference itemhttp://purl.org/coar/resource_type/c_5794uuid:f28ba2c9-b4fb-41b6-9927-79f38b4ce388Symplectic Elements at OxfordInstitute of Electrical and Electronics Engineers2017Belagiannis, VZisserman, AWe propose a ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part configuration. We make the following three contributions: (i) an architecture combining a feed forward module with a recurrent module, where the recurrent module can be run iteratively to improve the performance; (ii) the model can be trained end-to-end and from scratch, with auxiliary losses incorporated to improve performance; (iii) we investigate whether keypoint visibility can also be predicted. The model is evaluated on two benchmark datasets. The result is a simple architecture that achieves performance on par with the state of the art, but without the complexity of a graphical model stage (or layers).
spellingShingle	Belagiannis, V Zisserman, A Recurrent human pose estimation
title	Recurrent human pose estimation
title_full	Recurrent human pose estimation
title_fullStr	Recurrent human pose estimation
title_full_unstemmed	Recurrent human pose estimation
title_short	Recurrent human pose estimation
title_sort	recurrent human pose estimation
work_keys_str_mv	AT belagiannisv recurrenthumanposeestimation AT zissermana recurrenthumanposeestimation

Recurrent human pose estimation

Схожие документы