Disentangling the latent space of GANs for semantic face editing.

Disentanglement research is a critical and important issue in the field of image editing. In order to perform disentangled editing on images generated by generative models, this paper presents an unsupervised, model-agnostic, two-stage trained editing framework. This work addresses the problem of di...

Full description

Bibliographic Details
Main Authors: Yongjie Niu, Mingquan Zhou, Zhan Li
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293496&type=printable
_version_ 1797635230674190336
author Yongjie Niu
Mingquan Zhou
Zhan Li
author_facet Yongjie Niu
Mingquan Zhou
Zhan Li
author_sort Yongjie Niu
collection DOAJ
description Disentanglement research is a critical and important issue in the field of image editing. In order to perform disentangled editing on images generated by generative models, this paper presents an unsupervised, model-agnostic, two-stage trained editing framework. This work addresses the problem of discovering interpretable, disentangled directions of edited image attributes in the latent space of generative models. This effort's primary objective was to address the limitations discovered in previous research, mainly (a) the discovered editing directions are interpretable but significantly entangled, i.e., changes to one attribute affect the others and (b) Prior research has utilized direction discovery and direction disentanglement separately, and they can't work synergistically. More specifically, this paper proposes a two-stage training method that discovers the editing direction with semantics, perturbs the dimension of the direction vector, adjusts it with a penalty mechanism, and makes the editing direction more disentangled. This allows easy distinguishable image editing, such as age and facial expressions in facial images. Experimentally compared to other methods, the proposed method outperforms them both qualitatively and quantitatively in terms of interpretability, disentanglement, and distinguishability of the generated images. The implementation of our method is available at https://github.com/ydniuyongjie/twoStageForFaceEdit.
first_indexed 2024-03-11T12:19:13Z
format Article
id doaj.art-e463d0de9d644163a6c337bd50975a2e
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-03-11T12:19:13Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-e463d0de9d644163a6c337bd50975a2e2023-11-07T05:34:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-011810e029349610.1371/journal.pone.0293496Disentangling the latent space of GANs for semantic face editing.Yongjie NiuMingquan ZhouZhan LiDisentanglement research is a critical and important issue in the field of image editing. In order to perform disentangled editing on images generated by generative models, this paper presents an unsupervised, model-agnostic, two-stage trained editing framework. This work addresses the problem of discovering interpretable, disentangled directions of edited image attributes in the latent space of generative models. This effort's primary objective was to address the limitations discovered in previous research, mainly (a) the discovered editing directions are interpretable but significantly entangled, i.e., changes to one attribute affect the others and (b) Prior research has utilized direction discovery and direction disentanglement separately, and they can't work synergistically. More specifically, this paper proposes a two-stage training method that discovers the editing direction with semantics, perturbs the dimension of the direction vector, adjusts it with a penalty mechanism, and makes the editing direction more disentangled. This allows easy distinguishable image editing, such as age and facial expressions in facial images. Experimentally compared to other methods, the proposed method outperforms them both qualitatively and quantitatively in terms of interpretability, disentanglement, and distinguishability of the generated images. The implementation of our method is available at https://github.com/ydniuyongjie/twoStageForFaceEdit.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293496&type=printable
spellingShingle Yongjie Niu
Mingquan Zhou
Zhan Li
Disentangling the latent space of GANs for semantic face editing.
PLoS ONE
title Disentangling the latent space of GANs for semantic face editing.
title_full Disentangling the latent space of GANs for semantic face editing.
title_fullStr Disentangling the latent space of GANs for semantic face editing.
title_full_unstemmed Disentangling the latent space of GANs for semantic face editing.
title_short Disentangling the latent space of GANs for semantic face editing.
title_sort disentangling the latent space of gans for semantic face editing
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0293496&type=printable
work_keys_str_mv AT yongjieniu disentanglingthelatentspaceofgansforsemanticfaceediting
AT mingquanzhou disentanglingthelatentspaceofgansforsemanticfaceediting
AT zhanli disentanglingthelatentspaceofgansforsemanticfaceediting