A spin glass model for the loss surfaces of generative adversarial networks

We present a novel mathematical model that seeks to capture the key design feature of generative adversarial networks (GANs). Our model consists of two interacting spin glasses, and we conduct an extensive theoretical analysis of the complexity of the model’s critical points using techniques from Ra...

Full description

Bibliographic Details
Main Authors: Baskerville, N, Keating, J, Mezzadri, F, Najnudel, J
Format: Journal article
Language:English
Published: Springer 2022
_version_ 1797054290840256512
author Baskerville, N
Keating, J
Mezzadri, F
Najnudel, J
author_facet Baskerville, N
Keating, J
Mezzadri, F
Najnudel, J
author_sort Baskerville, N
collection OXFORD
description We present a novel mathematical model that seeks to capture the key design feature of generative adversarial networks (GANs). Our model consists of two interacting spin glasses, and we conduct an extensive theoretical analysis of the complexity of the model’s critical points using techniques from Random Matrix Theory. The result is insights into the loss surfaces of large GANs that build upon prior insights for simpler networks, but also reveal new structure unique to this setting which explains the greater difficulty of training GANs.
first_indexed 2024-03-06T18:55:09Z
format Journal article
id oxford-uuid:119653a3-56b5-4177-9bc8-829f659f6b61
institution University of Oxford
language English
last_indexed 2024-03-06T18:55:09Z
publishDate 2022
publisher Springer
record_format dspace
spelling oxford-uuid:119653a3-56b5-4177-9bc8-829f659f6b612022-03-26T10:03:08ZA spin glass model for the loss surfaces of generative adversarial networksJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:119653a3-56b5-4177-9bc8-829f659f6b61EnglishSymplectic ElementsSpringer2022Baskerville, NKeating, JMezzadri, FNajnudel, JWe present a novel mathematical model that seeks to capture the key design feature of generative adversarial networks (GANs). Our model consists of two interacting spin glasses, and we conduct an extensive theoretical analysis of the complexity of the model’s critical points using techniques from Random Matrix Theory. The result is insights into the loss surfaces of large GANs that build upon prior insights for simpler networks, but also reveal new structure unique to this setting which explains the greater difficulty of training GANs.
spellingShingle Baskerville, N
Keating, J
Mezzadri, F
Najnudel, J
A spin glass model for the loss surfaces of generative adversarial networks
title A spin glass model for the loss surfaces of generative adversarial networks
title_full A spin glass model for the loss surfaces of generative adversarial networks
title_fullStr A spin glass model for the loss surfaces of generative adversarial networks
title_full_unstemmed A spin glass model for the loss surfaces of generative adversarial networks
title_short A spin glass model for the loss surfaces of generative adversarial networks
title_sort spin glass model for the loss surfaces of generative adversarial networks
work_keys_str_mv AT baskervillen aspinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT keatingj aspinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT mezzadrif aspinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT najnudelj aspinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT baskervillen spinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT keatingj spinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT mezzadrif spinglassmodelforthelosssurfacesofgenerativeadversarialnetworks
AT najnudelj spinglassmodelforthelosssurfacesofgenerativeadversarialnetworks