A conditional deep generative model of people in natural images

We propose a deep generative model of humans in natural images which keeps 2D pose separated from other latent factors of variation, such as background scene and clothing. In contrast to methods that learn generative models of low-dimensional representations, e.g., segmentation masks and 2D skeleton...

Full description

Bibliographic Details
Main Authors: De Bem, R, Ghosh, A, Boukhayma, A, Ajanthan, T, Siddharth, N, Torr, P
Format: Conference item
Published: IEEE 2019
_version_ 1826274799274426368
author De Bem, R
Ghosh, A
Boukhayma, A
Ajanthan, T
Siddharth, N
Torr, P
author_facet De Bem, R
Ghosh, A
Boukhayma, A
Ajanthan, T
Siddharth, N
Torr, P
author_sort De Bem, R
collection OXFORD
description We propose a deep generative model of humans in natural images which keeps 2D pose separated from other latent factors of variation, such as background scene and clothing. In contrast to methods that learn generative models of low-dimensional representations, e.g., segmentation masks and 2D skeletons, our single-stage end-to-end conditional-VAEGAN learns directly on the image space. The flexibility of this approach allows the sampling of people with independent variations of pose and appearance. Moreover, it enables the reconstruction of images conditioned to a given posture, allowing, for instance, pose-transfer from one person to another. We validate our method on the Human3.6M dataset and achieve state-of-the-art results on the ChictopiaPlus benchmark. Our model, named Conditional-DGPose, outperforms the closest related work in the literature. It generates more realistic and accurate images regarding both, body posture and image quality, learning the underlying factors of pose and appearance variation.
first_indexed 2024-03-06T22:48:55Z
format Conference item
id oxford-uuid:5e2538d6-7f80-4b74-a95f-c09f3cc69051
institution University of Oxford
last_indexed 2024-03-06T22:48:55Z
publishDate 2019
publisher IEEE
record_format dspace
spelling oxford-uuid:5e2538d6-7f80-4b74-a95f-c09f3cc690512022-03-26T17:38:48ZA conditional deep generative model of people in natural imagesConference itemhttp://purl.org/coar/resource_type/c_5794uuid:5e2538d6-7f80-4b74-a95f-c09f3cc69051Symplectic Elements at OxfordIEEE2019De Bem, RGhosh, ABoukhayma, AAjanthan, TSiddharth, NTorr, PWe propose a deep generative model of humans in natural images which keeps 2D pose separated from other latent factors of variation, such as background scene and clothing. In contrast to methods that learn generative models of low-dimensional representations, e.g., segmentation masks and 2D skeletons, our single-stage end-to-end conditional-VAEGAN learns directly on the image space. The flexibility of this approach allows the sampling of people with independent variations of pose and appearance. Moreover, it enables the reconstruction of images conditioned to a given posture, allowing, for instance, pose-transfer from one person to another. We validate our method on the Human3.6M dataset and achieve state-of-the-art results on the ChictopiaPlus benchmark. Our model, named Conditional-DGPose, outperforms the closest related work in the literature. It generates more realistic and accurate images regarding both, body posture and image quality, learning the underlying factors of pose and appearance variation.
spellingShingle De Bem, R
Ghosh, A
Boukhayma, A
Ajanthan, T
Siddharth, N
Torr, P
A conditional deep generative model of people in natural images
title A conditional deep generative model of people in natural images
title_full A conditional deep generative model of people in natural images
title_fullStr A conditional deep generative model of people in natural images
title_full_unstemmed A conditional deep generative model of people in natural images
title_short A conditional deep generative model of people in natural images
title_sort conditional deep generative model of people in natural images
work_keys_str_mv AT debemr aconditionaldeepgenerativemodelofpeopleinnaturalimages
AT ghosha aconditionaldeepgenerativemodelofpeopleinnaturalimages
AT boukhaymaa aconditionaldeepgenerativemodelofpeopleinnaturalimages
AT ajanthant aconditionaldeepgenerativemodelofpeopleinnaturalimages
AT siddharthn aconditionaldeepgenerativemodelofpeopleinnaturalimages
AT torrp aconditionaldeepgenerativemodelofpeopleinnaturalimages
AT debemr conditionaldeepgenerativemodelofpeopleinnaturalimages
AT ghosha conditionaldeepgenerativemodelofpeopleinnaturalimages
AT boukhaymaa conditionaldeepgenerativemodelofpeopleinnaturalimages
AT ajanthant conditionaldeepgenerativemodelofpeopleinnaturalimages
AT siddharthn conditionaldeepgenerativemodelofpeopleinnaturalimages
AT torrp conditionaldeepgenerativemodelofpeopleinnaturalimages