Implicit dynamic regularization in deep networks

Implicit dynamic regularization in deep networks

Square loss has been observed to perform well in classification tasks, at least as well as crossentropy. However, a theoretical justification is lacking. Here we develop a theoretical analysis for the square loss that also complements the existing asymptotic analysis for the exponential loss.

Bibliographic Details
Main Authors:	Poggio, Tomaso, Liao, Qianli
Format:	Technical Report
Published:	Center for Brains, Minds and Machines (CBMM) 2020
Online Access:	https://hdl.handle.net/1721.1/126653

Similar Items

Theoretical Issues in Deep Networks
by: Poggio, Tomaso, et al.
Published: (2019)

Theoretical issues in deep networks
by: Poggio, Tomaso, et al.
Published: (2021)

SGD Noise and Implicit Low-Rank Bias in Deep Neural Networks
by: Galanti, Tomer, et al.
Published: (2022)

Complexity control by gradient descent in deep networks
by: Poggio, Tomaso, et al.
Published: (2021)

Complexity control by gradient descent in deep networks
by: Tomaso Poggio, et al.
Published: (2020-02-01)

Object-Oriented Deep Learning
by: Liao, Qianli, et al.
Published: (2017)

Complexity control by gradient descent in deep networks
by: Poggio, Tomaso A, et al.
Published: (2022)

Theory II: Landscape of the Empirical Risk in Deep Learning
by: Poggio, Tomaso, et al.
Published: (2017)

Hierarchically Local Tasks and Deep Convolutional Networks
by: Deza, Arturo, et al.
Published: (2020)

Classical generalization bounds are surprisingly tight for Deep Networks
by: Liao, Qianli, et al.
Published: (2018)

Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex
by: Liao, Qianli, et al.
Published: (2016)

Theory IIIb: Generalization in Deep Networks
by: Poggio, Tomaso, et al.
Published: (2018)

Learning Real and Boolean Functions: When Is Deep Better Than Shallow
by: Mhaskar, Hrushikesh, et al.
Published: (2016)

Theory I: Why and When Can Deep Networks Avoid the Curse of Dimensionality?
by: Poggio, Tomaso, et al.
Published: (2016)

Dynamics in Deep Classifiers Trained with the Square Loss: Normalization, Low Rank, Neural Collapse, and Generalization Bounds
by: Mengjia Xu, et al.
Published: (2023-01-01)

Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review
by: Mhaskar, Hrushikesh, et al.
Published: (2017)

Representations That Learn vs. Learning Representations
by: Liao, Qianli, et al.
Published: (2018)

When Is Handcrafting Not a Curse?
by: Liao, Qianli, et al.
Published: (2018)

Human-like Learning: A Research Proposal
by: Liao, Qianli, et al.
Published: (2017)

Exact Equivariance, Disentanglement and Invariance of Transformations
by: Liao, Qianli, et al.
Published: (2018)

3D Object-Oriented Learning: An End-to-end Transformation-Disentangled 3D Representation
by: Liao, Qianli, et al.
Published: (2018)

The Weights Reset Technique for Deep Neural Networks Implicit Regularization
by: Grigoriy Plusch, et al.
Published: (2023-08-01)

From Associative Memories to Deep Networks
by: Poggio, Tomaso
Published: (2021)

Compression of Deep Neural Networks for Image Instance Retrieval
by: Chandrasekhar, Vijay, et al.
Published: (2017)

A Unified Framework for Regularization Networks and Support Vector Machines
by: Evgeniou, Theodoros, et al.
Published: (2004)

Regularization Theory and Shape Constraints
by: Verri, Alessandro, et al.
Published: (2004)

Spatial IQ Test for AI
by: Hilton, Erwin, et al.
Published: (2018)

Streaming Normalization: Towards Simpler and More Biologically-plausible Normalizations for Online and Recurrent Learning
by: Liao, Qianli, et al.
Published: (2016)

Notes on PCA, Regularization, Sparsity and Support Vector Machines
by: Poggio, Tomaso, et al.
Published: (2004)

Ill-Posed Problems and Regularization Analysis in Early Vision
by: Tomaso, Poggio, et al.
Published: (2004)

Associative Learning of Standard Regularizing Operators in Early Vision
by: Poggio, Tomaso, et al.
Published: (2008)

Theory of Deep Learning IIb: Optimization Properties of SGD
by: Zhang, Chiyuan, et al.
Published: (2018)

Bagging Regularizes
by: Poggio, Tomaso, et al.
Published: (2004)

Symmetry Regularization
by: Anselmi, Fabio, et al.
Published: (2017)

An Overview of Some Issues in the Theory of Deep Networks
by: Poggio, Tomaso, et al.
Published: (2022)

An Overview of Some Issues in the Theory of Deep Networks
by: Poggio, Tomaso, et al.
Published: (2021)

A Regularized Solution to Edge Detection
by: Poggio, Tomaso, et al.
Published: (2004)

Regularization Predicts While Discovering Taxonomy
by: Mroueh, Youssef, et al.
Published: (2011)

An analysis of training and generalization errors in shallow and deep networks
by: Mhaskar, Hrushikesh, et al.
Published: (2018)

Do deep neural networks suffer from crowding?
by: Poggio, Tomaso, et al.
Published: (2021)