How early can we average Neural Networks?

There is a recurring observation in deep learning that neural networks can be combined simply with arithmetic averages over their parameters. This observation has led to many new research directions in model ensembling, meta-learning, federated learning, and optimization. We investigate the evolutio...

Full description

Bibliographic Details
Main Author: Nasimov, Umarbek
Other Authors: Poggio, Tomaso
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/151660