Musings on Deep Learning: Properties of SGD
[previously titled "Theory of Deep Learning III: Generalization Properties of SGD"] In Theory III we characterize with a mix of theory and experiments the generalization properties of Stochastic Gradient Descent in overparametrized deep convolutional networks. We show that Stochastic Gradi...
Full description
Bibliographic Details
Main Authors: |
Zhang, Chiyuan,
Liao, Qianli,
Rakhlin, Alexander,
Sridharan, Karthik,
Miranda, Brando,
Golowich, Noah,
Poggio, Tomaso |
Format: | Technical Report
|
Language: | en_US |
Published: |
Center for Brains, Minds and Machines (CBMM)
2017
|
Online Access: | http://hdl.handle.net/1721.1/107841
|