Semantic photo manipulation with a generative image prior

© 2019 Association for Computing Machinery. All rights reserved. Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the highlevel attributes of an existing natural photograph with GANs is challenging for two r...

Full description

Bibliographic Details
Main Authors: Bau, David, Strobelt, Hendrik, Peebles, William, Wulff, Jonas, Zhou, Bolei, Zhu, Jun-Yan, Torralba, Antonio
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:English
Published: Association for Computing Machinery (ACM) 2021
Online Access:https://hdl.handle.net/1721.1/136406
_version_ 1811097093322833920
author Bau, David
Strobelt, Hendrik
Peebles, William
Wulff, Jonas
Zhou, Bolei
Zhu, Jun-Yan
Torralba, Antonio
author2 Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Bau, David
Strobelt, Hendrik
Peebles, William
Wulff, Jonas
Zhou, Bolei
Zhu, Jun-Yan
Torralba, Antonio
author_sort Bau, David
collection MIT
description © 2019 Association for Computing Machinery. All rights reserved. Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the highlevel attributes of an existing natural photograph with GANs is challenging for two reasons. First, it is hard for GANs to precisely reproduce an input image. Second, after manipulation, the newly synthesized pixels often do not fit the original image. In this paper, we address these issues by adapting the image prior learned by GANs to image statistics of an individual image. Our method can accurately reconstruct the input image and synthesize new content, consistent with the appearance of the input image. We demonstrate our interactive system on several semantic image editing tasks, including synthesizing new objects consistent with background, removing unwanted objects, and changing the appearance of an object. Quantitative and qualitative comparisons against several existing methods demonstrate the effectiveness of our method.
first_indexed 2024-09-23T16:54:13Z
format Article
id mit-1721.1/136406
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T16:54:13Z
publishDate 2021
publisher Association for Computing Machinery (ACM)
record_format dspace
spelling mit-1721.1/1364062023-01-20T20:42:19Z Semantic photo manipulation with a generative image prior Bau, David Strobelt, Hendrik Peebles, William Wulff, Jonas Zhou, Bolei Zhu, Jun-Yan Torralba, Antonio Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory MIT-IBM Watson AI Lab © 2019 Association for Computing Machinery. All rights reserved. Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the highlevel attributes of an existing natural photograph with GANs is challenging for two reasons. First, it is hard for GANs to precisely reproduce an input image. Second, after manipulation, the newly synthesized pixels often do not fit the original image. In this paper, we address these issues by adapting the image prior learned by GANs to image statistics of an individual image. Our method can accurately reconstruct the input image and synthesize new content, consistent with the appearance of the input image. We demonstrate our interactive system on several semantic image editing tasks, including synthesizing new objects consistent with background, removing unwanted objects, and changing the appearance of an object. Quantitative and qualitative comparisons against several existing methods demonstrate the effectiveness of our method. 2021-10-27T20:35:13Z 2021-10-27T20:35:13Z 2019 2021-04-15T17:32:39Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/136406 en 10.1145/3306346.3323023 ACM Transactions on Graphics Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for Computing Machinery (ACM) arXiv
spellingShingle Bau, David
Strobelt, Hendrik
Peebles, William
Wulff, Jonas
Zhou, Bolei
Zhu, Jun-Yan
Torralba, Antonio
Semantic photo manipulation with a generative image prior
title Semantic photo manipulation with a generative image prior
title_full Semantic photo manipulation with a generative image prior
title_fullStr Semantic photo manipulation with a generative image prior
title_full_unstemmed Semantic photo manipulation with a generative image prior
title_short Semantic photo manipulation with a generative image prior
title_sort semantic photo manipulation with a generative image prior
url https://hdl.handle.net/1721.1/136406
work_keys_str_mv AT baudavid semanticphotomanipulationwithagenerativeimageprior
AT strobelthendrik semanticphotomanipulationwithagenerativeimageprior
AT peebleswilliam semanticphotomanipulationwithagenerativeimageprior
AT wulffjonas semanticphotomanipulationwithagenerativeimageprior
AT zhoubolei semanticphotomanipulationwithagenerativeimageprior
AT zhujunyan semanticphotomanipulationwithagenerativeimageprior
AT torralbaantonio semanticphotomanipulationwithagenerativeimageprior