A Bounded Scheduling Method for Adaptive Gradient Methods

Many adaptive gradient methods have been successfully applied to train deep neural networks, such as Adagrad, Adadelta, RMSprop and Adam. These methods perform local optimization with an element-wise scaling learning rate based on past gradients. Although these methods can achieve an advantageous tr...

Full description

Bibliographic Details
Main Authors:	Mingxing Tang, Zhen Huang, Yuan Yuan, Changjian Wang, Yuxing Peng
Format:	Article
Language:	English
Published:	MDPI AG 2019-09-01
Series:	Applied Sciences
Subjects:	deep neural networks adaptive gradient methods stochastic gradient descent bounded scheduling method image classification language modeling
Online Access:	https://www.mdpi.com/2076-3417/9/17/3569

Internet

https://www.mdpi.com/2076-3417/9/17/3569

A Bounded Scheduling Method for Adaptive Gradient Methods

Internet

Similar Items