Activation function design for deep networks: linearity and effective initialisation

The activation function deployed in a deep neural network has great influence on the performance of the network at initialisation, which in turn has implications for training. In this paper we study how to avoid two problems at initialisation identified in prior works: rapid convergence of pairwise...

Full description

Bibliographic Details
Main Authors: Murray, M, Abrol, V, Tanner, J
Format: Journal article
Language:English
Published: Elsevier 2022