Effect of Depth and Width on Local Minima in Deep Learning
© 2019 Massachusetts Institute of Technology. For nonconvex optimization in machine learning, this article proves that every local minimum achieves the globally optimal value of the perturbable gradient basis model at any differentiable point. As a result, nonconvex machine learning is theoretically...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MIT Press - Journals
2021
|
Online Access: | https://hdl.handle.net/1721.1/136181 |