An Empirical and Theoretical Analysis of the Role of Depth in Convolutional Neural Networks

While over-parameterized neural networks are capable of perfectly fitting (interpolating) training data, these networks often perform well on test data, thereby contradicting classical learning theory. Recent work provided an explanation for this phenomenon by introducing the double descent curve, s...

Full description

Bibliographic Details
Main Author: Nichani, Eshaan
Other Authors: Uhler, Caroline
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139174