On the Convergence Proof of AMSGrad and a New Version

On the Convergence Proof of AMSGrad and a New Version

The adaptive moment estimation algorithm Adam (Kingma and Ba) is a popular optimizer in the training of deep neural networks. However, Reddi et al. have recently shown that the convergence proof of Adam is problematic, and they have also proposed a variant of Adam called AMSGrad as a fix. In this pa...

Full description

Bibliographic Details
Main Authors:	Phuong Thi Tran, Le Trieu Phong
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Optimizer adaptive moment estimation Adam AMSGrad deep neural networks
Online Access:	https://ieeexplore.ieee.org/document/8713445/

Similar Items

Unified Algorithm Framework for Nonconvex Stochastic Optimization in Deep Neural Networks
by: Yini Zhu, et al.
Published: (2021-01-01)

The buffered optimization methods for online transfer function identification employed on DEAP actuator
by: Jakub Bernat, et al.
Published: (2023-09-01)

GAPCNN with HyPar: Global Average Pooling convolutional neural network with novel NNLU activation function and HYBRID parallelism
by: Gousia Habib, et al.
Published: (2022-11-01)

A ResNet‐based approach for accurate radiographic diagnosis of knee osteoarthritis
by: Yu Wang, et al.
Published: (2022-09-01)

Communication-Efficient Distributed SGD with Error-Feedback, Revisited
by: Tran Thi Phuong, et al.
Published: (2021-04-01)

Distributed SGD With Flexible Gradient Compression
by: Tran Thi Phuong, et al.
Published: (2020-01-01)

HyAdamC: A New Adam-Based Hybrid Optimization Algorithm for Convolution Neural Networks
by: Kyung-Soo Kim, et al.
Published: (2021-06-01)

Improving Convergence in Therapy Scheduling Optimization: A Simulation Study
by: Juan C. Chimal-Eguia, et al.
Published: (2020-11-01)

Channel estimation using hybrid optimizer based recurrent neural network long short term memory for MIMO communications in 5G network
by: Lipsa Dash, et al.
Published: (2023-01-01)

Pattern Recognition of DC Partial Discharge on XLPE Cable Based on ADAM-DBN
by: Zhe Li, et al.
Published: (2020-09-01)

Distributed SignSGD With Improved Accuracy and Network-Fault Tolerance
by: Trieu Le Phong, et al.
Published: (2020-01-01)

Adam and the Ants: On the Influence of the Optimization Algorithm on the Detectability of DNN Watermarks
by: Betty Cortiñas-Lorenzo, et al.
Published: (2020-12-01)

An Interpretation of Long Short-Term Memory Recurrent Neural Network for Approximating Roots of Polynomials
by: Madiha Bukhsh, et al.
Published: (2022-01-01)

HW-ADAM: FPGA-Based Accelerator for Adaptive Moment Estimation
by: Weiyi Zhang, et al.
Published: (2023-01-01)

An Automated Hyperparameter Tuning Recurrent Neural Network Model for Fruit Classification
by: Kathiresan Shankar, et al.
Published: (2022-07-01)

Transformers for Urban Sound Classification—A Comprehensive Performance Evaluation
by: Ana Filipa Rodrigues Nogueira, et al.
Published: (2022-11-01)

One-Dimensional Convolutional Neural Network with Adaptive Moment Estimation for Modelling of the Sand Retention Test
by: Nurul Nadhirah Abd Razak, et al.
Published: (2021-04-01)

Komparasi Metode Optimasi Adam dan SGD dalam Skema Direct Inverse Control untuk Sistem Kendali Data Sikap dan Ketinggian Quadcopter
by: MUHAMMAD SABILA HAQQI, et al.
Published: (2022-04-01)

A Robust Version of the Empirical Likelihood Estimator
by: Amor Keziou, et al.
Published: (2021-04-01)

Enhancing parasitic organism detection in microscopy images through deep learning and fine-tuned optimizer
by: Yogesh Kumar, et al.
Published: (2024-03-01)

Recognizing Beehives’ Health Abnormalities Based on Mobile Net Deep Learning Model
by: Mohamed Torky, et al.
Published: (2023-08-01)

Deep Learning Approach to Technician Routing and Scheduling Problem
by: Engin Pekel
Published: (2022-10-01)

Automatic Breast Tumor Screening of Mammographic Images with Optimal Convolutional Neural Network
by: Pi-Yun Chen, et al.
Published: (2022-04-01)

A Clutter Parameter Estimation Method Based on Origin Moment Derivation
by: Liru Yang, et al.
Published: (2023-03-01)

The Effect Of Optimizers On The Generalizability Additive Neural Attention For Customer Support Twitter Dataset In Chatbot Application
by: Sinarwati Mohamad Suhaili, et al.
Published: (2024-02-01)

Rate of complete second-order moment convergence and theoretical applications
by: M. Madi, et al.
Published: (2022-12-01)

Distributed Stochastic Gradient Descent With Compressed and Skipped Communication
by: Tran Thi Phuong, et al.
Published: (2023-01-01)

Enhancing Performance of a Deep Neural Network: A Comparative Analysis of Optimization Algorithms
by: Noor Fatima
Published: (2020-06-01)

Research on the application of improved Adam training optimizer in gas emission prediction
by: LIU Haidong, et al.
Published: (2023-12-01)

Ground simulation research on the super large moment of inertia of the space transfer manipulator
by: Zhang Wei, et al.
Published: (2020-01-01)

Nonrigid Medical Image Registration using Adaptive Gradient Optimizer
by: Mohammed Abo Arab, et al.
Published: (2021-06-01)

Short Term Electric Power Load Forecasting Using Principal Component Analysis and Recurrent Neural Networks
by: Venkataramana Veeramsetty, et al.
Published: (2022-01-01)

Estimation of Lower Limb Joint Angles and Joint Moments during Different Locomotive Activities Using the Inertial Measurement Units and a Hybrid Deep Learning Model
by: Fanjie Wang, et al.
Published: (2023-11-01)

EMG-Based Estimation of Lower Limb Joint Angles and Moments Using Long Short-Term Memory Network
by: Minh Tat Nhat Truong, et al.
Published: (2023-03-01)

On the Convergence of Stochastic Process Convergence Proofs
by: Borja Sánchez-López, et al.
Published: (2021-06-01)

Working Process Analysis and Parameter Optimization of Lock Ring Synchronizer
by: You Guogui, et al.
Published: (2016-01-01)

Scour modeling using deep neural networks based on hyperparameter optimization
by: Mohammed Asim, et al.
Published: (2022-09-01)

An extended version of Kumaraswamy inverse Weibull distribution and its properties
by: Satheesh C. Kumar, et al.
Published: (2016-09-01)

An optimized capsule neural networks for tomato leaf disease classification
by: Lobna M. Abouelmagd, et al.
Published: (2024-01-01)

Automatic detection of concrete cracks from images using Adam-SqueezeNet deep learning model
by: Lin Wang
Published: (2023-07-01)