Damped Newton Stochastic Gradient Descent Method for Neural Networks Training

Damped Newton Stochastic Gradient Descent Method for Neural Networks Training

First-order methods such as stochastic gradient descent (SGD) have recently become popular optimization methods to train deep neural networks (DNNs) for good generalization; however, they need a long training time. Second-order methods which can lower the training time are scarcely used on account o...

Description complète

Détails bibliographiques
Auteurs principaux:	Jingcheng Zhou, Wei Wei, Ruizhi Zhang, Zhiming Zheng
Format:	Article
Langue:	English
Publié:	MDPI AG 2021-06-01
Collection:	Mathematics
Sujets:	stochastic gradient descent damped Newton convexity
Accès en ligne:	https://www.mdpi.com/2227-7390/9/13/1533

Documents similaires

Adaptive Stochastic Gradient Descent Method for Convex and Non-Convex Optimization
par: Ruijuan Chen, et autres
Publié: (2022-11-01)

The Improved Stochastic Fractional Order Gradient Descent Algorithm
par: Yang Yang, et autres
Publié: (2023-08-01)

Recent Advances in Stochastic Gradient Descent in Deep Learning
par: Yingjie Tian, et autres
Publié: (2023-01-01)

A Geometric Interpretation of Stochastic Gradient Descent Using Diffusion Metrics
par: Rita Fioresi, et autres
Publié: (2020-01-01)

Stochastic gradient descent with random label noises: doubly stochastic models and inference stabilizer
par: Haoyi Xiong, et autres
Publié: (2024-01-01)

Distributed Stochastic Gradient Descent With Compressed and Skipped Communication
par: Tran Thi Phuong, et autres
Publié: (2023-01-01)

Modified‎ ‎Step‎ ‎Size‎ ‎for‎ ‎Enhanced‎ ‎Stochastic Gradient Descent‎: ‎Convergence and Experiments
par: Mahsa Soheil shamaee, et autres
Publié: (2024-09-01)

Pipelined Stochastic Gradient Descent with Taylor Expansion
par: Bongwon Jang, et autres
Publié: (2023-10-01)

A Novel Sine Step Size for Warm-Restart Stochastic Gradient Descent
par: Mahsa Soheil Shamaee, et autres
Publié: (2024-12-01)

Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks
par: Shrihari Vasudevan
Publié: (2020-05-01)

Adaptive Gradient Estimation Stochastic Parallel Gradient Descent Algorithm for Laser Beam Cleanup
par: Shiqing Ma, et autres
Publié: (2021-05-01)

Smoothing gradient descent algorithm for the composite sparse optimization
par: Wei Yang, et autres
Publié: (2024-11-01)

Stochastic Gradient Descent for Kernel-Based Maximum Correntropy Criterion
par: Tiankai Li, et autres
Publié: (2024-12-01)

Design of Momentum Fractional Stochastic Gradient Descent for Recommender Systems
par: Zeshan Aslam Khan, et autres
Publié: (2019-01-01)

Adaptive Stochastic Conjugate Gradient Optimization for Backpropagation Neural Networks
par: Ibrahim Abaker Targio Hashem, et autres
Publié: (2024-01-01)

Counterexamples for Noise Models of Stochastic Gradients
par: Vivak Patel
Publié: (2023-12-01)

Fast Iterative Hybrid Precoding and Combining With Momentum Gradient Descent and Newton’s Method for Millimeter Wave MIMO Systems
par: Mohamed Alouzi, et autres
Publié: (2023-01-01)

Public Security Video Surveillance Image Restoration Based on Stochastic Gradient Descent Algorithm
par: Yuxiao MENG, et autres
Publié: (2022-11-01)

Accelerated Singular Value Decomposition (ASVD) using momentum based Gradient Descent Optimization
par: Sandeep Kumar Raghuwanshi, et autres
Publié: (2021-05-01)

Determination of accelerated factors in gradient descent iterations based on Taylor's series
par: Petrović Milena, et autres
Publié: (2017-01-01)

Semi-Stochastic Gradient Descent Methods
par: Jakub Konečný, et autres
Publié: (2017-05-01)

Estimation of simultaneous equation models by backpropagation method using stochastic gradient descent
par: Belén Pérez-Sánchez, et autres
Publié: (2024-10-01)

Performance Evaluation of Gradient Descent Optimizers in Estuarine Turbidity Estimation with Multilayer Perceptron and Sentinel-2 Imagery
par: Naledzani Ndou, et autres
Publié: (2024-10-01)

Stochastic gradient descent algorithm preserving differential privacy in MapReduce framework
par: Yihan YU, et autres
Publié: (2018-01-01)

Stochastic gradient descent algorithm preserving differential privacy in MapReduce framework
par: Yihan YU, et autres
Publié: (2018-01-01)

A Method for Transforming Non-Convex Optimization Problem to Distributed Form
par: Oleg O. Khamisov, et autres
Publié: (2024-09-01)

Newton's method in the context of gradients
par: John W. Neuberger, et autres
Publié: (2007-09-01)

Hybrid Distributed Optimization for Learning Over Networks With Heterogeneous Agents
par: Mohammad H. Nassralla, et autres
Publié: (2023-01-01)

On some stochastic mirror descent methods for constrained online optimization problems
par: Mohammad S. Alkousa
Publié: (2019-04-01)

Gradient Descent Batch Clustering for Image Classification
par: Jae-Sam Park
Publié: (2023-07-01)

Adaptive Human–Machine Evaluation Framework Using Stochastic Gradient Descent-Based Reinforcement Learning for Dynamic Competing Network
par: Jinbae Kim, et autres
Publié: (2020-04-01)

Distributed stochastic gradient descent for link prediction in signed social networks
par: Han Zhang, et autres
Publié: (2019-01-01)

Belief-Rule-Base Inference Method Based on Gradient Descent With Momentum
par: Yu Guan, et autres
Publié: (2021-01-01)

Training Neural Networks by Time-Fractional Gradient Descent
par: Jingyi Xie, et autres
Publié: (2022-09-01)

Function approximation method based on weights gradient descent in reinforcement learning
par: Xiaoyan QIN, et autres
Publié: (2023-08-01)

Function approximation method based on weights gradient descent in reinforcement learning
par: Xiaoyan QIN, Yuhan LIU, Yunlong XU, Bin LI
Publié: (2023-08-01)

Restoration of Degraded Images Using Pupil-Size Diversity Technology With Stochastic Parallel Gradient Descent Algorithm
par: Zongliang Xie, et autres
Publié: (2016-01-01)

Phase Optimized Computer-Generated Holographic Video Calculation With Frame Interpolation Using Gradient Descent Algorithm
par: Gyeongsu Jin, et autres
Publié: (2024-01-01)

Forest fire risk assessment model optimized by stochastic average gradient descent
par: Zexin Fu, et autres
Publié: (2025-01-01)

aSGD: Stochastic Gradient Descent with Adaptive Batch Size for Every Parameter
par: Haoze Shi, et autres
Publié: (2022-03-01)