The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement

Part of the Lecture Notes in Computer Science book series (LNCS, volume 12351)

Bibliographic Details
Main Authors: Peebles, William, Peebles, John, Zhu, Jun-Yan, Efros, Alexei, Torralba, Antonio
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Book
Language:English
Published: Springer International Publishing 2021
Online Access:https://hdl.handle.net/1721.1/130353
_version_ 1811091897776603136
author Peebles, William
Peebles, John
Zhu, Jun-Yan
Efros, Alexei
Torralba, Antonio
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Peebles, William
Peebles, John
Zhu, Jun-Yan
Efros, Alexei
Torralba, Antonio
author_sort Peebles, William
collection MIT
description Part of the Lecture Notes in Computer Science book series (LNCS, volume 12351)
first_indexed 2024-09-23T15:09:45Z
format Book
id mit-1721.1/130353
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T15:09:45Z
publishDate 2021
publisher Springer International Publishing
record_format dspace
spelling mit-1721.1/1303532022-10-02T01:00:30Z The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement Peebles, William Peebles, John Zhu, Jun-Yan Efros, Alexei Torralba, Antonio Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Part of the Lecture Notes in Computer Science book series (LNCS, volume 12351) Existing disentanglement methods for deep generative models rely on hand-picked priors and complex encoder-based architectures. In this paper, we propose the Hessian Penalty, a simple regularization term that encourages the Hessian of a generative model with respect to its input to be diagonal. We introduce a model-agnostic, unbiased stochastic approximation of this term based on Hutchinson’s estimator to compute it efficiently during training. Our method can be applied to a wide range of deep generators with just a few lines of code. We show that training with the Hessian Penalty often causes axis-aligned disentanglement to emerge in latent space when applied to ProGAN on several datasets. Additionally, we use our regularization term to identify interpretable directions in BigGAN’s latent space in an unsupervised fashion. Finally, we provide empirical evidence that the Hessian Penalty encourages substantial shrinkage when applied to over-parameterized latent spaces. We encourage readers to view videos of our disentanglement results at www.wpeebles.com/hessian-penalty, and code at https://github.com/wpeebles/hessian_penalty. 2021-04-02T15:45:58Z 2021-04-02T15:45:58Z 2020-11 2021-01-28T16:07:58Z Book http://purl.org/eprint/type/ConferencePaper 9783030585389 9783030585396 0302-9743 1611-3349 https://hdl.handle.net/1721.1/130353 Peebles, William et al. "The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement." ECCV: European Conference on Computer Vision, Lecture Notes in Computer Science, 12351, Springer International Publishing, 2020, 581-597. © 2020, Springer Nature en http://dx.doi.org/10.1007/978-3-030-58539-6_35 Lecture Notes in Computer Science Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Springer International Publishing arXiv
spellingShingle Peebles, William
Peebles, John
Zhu, Jun-Yan
Efros, Alexei
Torralba, Antonio
The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
title The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
title_full The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
title_fullStr The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
title_full_unstemmed The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
title_short The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement
title_sort hessian penalty a weak prior for unsupervised disentanglement
url https://hdl.handle.net/1721.1/130353
work_keys_str_mv AT peebleswilliam thehessianpenaltyaweakpriorforunsuperviseddisentanglement
AT peeblesjohn thehessianpenaltyaweakpriorforunsuperviseddisentanglement
AT zhujunyan thehessianpenaltyaweakpriorforunsuperviseddisentanglement
AT efrosalexei thehessianpenaltyaweakpriorforunsuperviseddisentanglement
AT torralbaantonio thehessianpenaltyaweakpriorforunsuperviseddisentanglement
AT peebleswilliam hessianpenaltyaweakpriorforunsuperviseddisentanglement
AT peeblesjohn hessianpenaltyaweakpriorforunsuperviseddisentanglement
AT zhujunyan hessianpenaltyaweakpriorforunsuperviseddisentanglement
AT efrosalexei hessianpenaltyaweakpriorforunsuperviseddisentanglement
AT torralbaantonio hessianpenaltyaweakpriorforunsuperviseddisentanglement