Activation function design for deep networks: linearity and effective initialisation
The activation function deployed in a deep neural network has great influence on the performance of the network at initialisation, which in turn has implications for training. In this paper we study how to avoid two problems at initialisation identified in prior works: rapid convergence of pairwise...
Main Authors: | , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
Elsevier
2022
|