Training Neural Networks by Time-Fractional Gradient Descent

Motivated by the weighted averaging method for training neural networks, we study the time-fractional gradient descent (TFGD) method based on the time-fractional gradient flow and explore the influence of memory dependence on neural network training. The TFGD algorithm in this paper is studied via t...

Full description

Bibliographic Details
Main Authors: Jingyi Xie, Sirui Li
Format: Article
Language:English
Published: MDPI AG 2022-09-01
Series:Axioms
Subjects:
Online Access:https://www.mdpi.com/2075-1680/11/10/507