Norm-Based Generalization Bounds for Compositionally Sparse Neural Network

Norm-Based Generalization Bounds for Compositionally Sparse Neural Network

In this paper, we investigate the Rademacher complexity of deep sparse neural networks, where each neuron receives a small number of inputs. We prove generalization bounds for multilayered sparse ReLU neural networks, including convolutional neural networks. These bounds differ from previous ones, a...

Full description

Bibliographic Details
Main Authors:	Galanti, Tomer, Xu, Mengjia, Galanti, Liane, Poggio, Tomaso
Format:	Article
Published:	Center for Brains, Minds and Machines (CBMM) 2023
Online Access:	https://hdl.handle.net/1721.1/148230

Similar Items

SGD Noise and Implicit Low-Rank Bias in Deep Neural Networks
by: Galanti, Tomer, et al.
Published: (2022)

Formation of Representations in Neural Networks
by: Ziyin, Liu, et al.
Published: (2024)

SGD and Weight Decay Provably Induce a Low-Rank Bias in Deep Neural Networks
by: Galanti, Tomer, et al.
Published: (2023)

The Janus effects of SGD vs GD: high noise and low rank
by: Xu, Mengjia, et al.
Published: (2023)

On Generalization Bounds for Neural Networks with Low Rank Layers
by: Pinto, Andrea, et al.
Published: (2024)

Feature learning in deep classifiers through Intermediate Neural Collapse
by: Rangamani, Akshay, et al.
Published: (2023)

On the Power of Decision Trees in Auto-Regressive Language Modeling
by: Gan, Yulu, et al.
Published: (2024)

Classical generalization bounds are surprisingly tight for Deep Networks
by: Liao, Qianli, et al.
Published: (2018)

Nearly-optimal bounds for sparse recovery in generic norms, with applications to k-median sketching
by: Woodruff, David P., et al.
Published: (2018)

Sparse Representations of Multiple Signals
by: Evgeniou, Theodoros, et al.
Published: (2004)

Generalization and Properties of the Neural Response
by: Bouvrie, Jake, et al.
Published: (2010)

Data-dependent coresets for compressing neural networks with applications to generalization bounds
by: Baykal, Cenk, et al.
Published: (2022)

Data-dependent coresets for compressing neural networks with applications to generalization bounds
Published: (2021)

Lower bounds for sparse recovery
by: Indyk, Piotr, et al.
Published: (2011)

An analysis of training and generalization errors in shallow and deep networks
by: Mhaskar, Hrushikesh, et al.
Published: (2018)

For interpolating kernel machines, the minimum norm ERM solution is the most stable
by: Rangamani, Akshay, et al.
Published: (2020)

Sparse Correlation Kernel Analysis and Reconstruction
by: Papgeorgiou, Constantine P., et al.
Published: (2004)

An analysis of training and generalization errors in shallow and deep networks
by: Mhaskar, H.N., et al.
Published: (2019)

Algorithms and lower bounds for sparse recovery
by: Price, Eric (Eric C.)
Published: (2011)

Do deep neural networks suffer from crowding?
by: Poggio, Tomaso, et al.
Published: (2021)

Sparse dynamic reorganising fuzzy neural networks
by: Zhou, Jair Weigui
Published: (2017)

Theory IIIb: Generalization in Deep Networks
by: Poggio, Tomaso, et al.
Published: (2018)

Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex
by: Liao, Qianli, et al.
Published: (2016)

Lower bounds on nonnegative rank via nonnegative nuclear norms
by: Fawzi, Hamza, et al.
Published: (2016)

Do Deep Neural Networks Suffer from Crowding?
by: Volokitin, Anna, et al.
Published: (2017)

Do deep neural networks suffer from crowding?
by: Volokitin, Anna, et al.
Published: (2022)

Denoising by Sparse Approximation: Error Bounds Based on Rate-Distortion Theory
by: Goyal, Vivek K., et al.
Published: (2011)

The Lottery Ticket Hypothesis: On Sparse, Trainable Neural Networks
by: Frankle, Jonathan
Published: (2023)

Algorithms and lower bounds in the streaming and sparse recovery models
by: Do Ba, Khanh
Published: (2012)

Lower bounds for embedding the Earth Mover Distance metric into normed spaces
by: Samuel, Javed K. K
Published: (2006)

Bounds on the growth of high Sobolev norms of solutions to nonlinear Schrödinger equations
by: Sohinger, Vedran
Published: (2011)

Large-Scale Optical Hardware for Neural Network Inference Acceleration
by: Bernstein, Liane
Published: (2024)

Systematic Modeling and Design of Sparse Deep Neural Network Accelerators
by: Wu, Yannan
Published: (2023)

Sparse deep neural network for encoding and decoding the structural connectome
by: Singh, Satya P., et al.
Published: (2024)

Beyond Memorization: Exploring the Dynamics of Grokking in Sparse Neural Networks
by: Fuangkawinsombut, Siwakorn
Published: (2024)

The lottery ticket hypothesis: Finding sparse, trainable neural networks
by: Frankle, Jonathan, et al.
Published: (2021)

Circuit-size Lower Bounds and Non-reducibility to Sparse Sets
by: Kannan, Ravindran
Published: (2023)

Reducibility and computational lower bounds for problems with planted sparse structure
by: Brennan, Matthew (Matthew Stewart)
Published: (2018)

Compositional Sparsity: a framework for ML
by: Poggio, Tomaso
Published: (2022)

A natural-norm Successive Constraint Method for inf-sup lower bounds
by: Huynh, Dinh Bao Phuong, et al.
Published: (2011)