A Bounded Scheduling Method for Adaptive Gradient Methods

Many adaptive gradient methods have been successfully applied to train deep neural networks, such as Adagrad, Adadelta, RMSprop and Adam. These methods perform local optimization with an element-wise scaling learning rate based on past gradients. Although these methods can achieve an advantageous tr...

Full description

Bibliographic Details
Main Authors: Mingxing Tang, Zhen Huang, Yuan Yuan, Changjian Wang, Yuxing Peng
Format: Article
Language:English
Published: MDPI AG 2019-09-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/9/17/3569