Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks

Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks

This paper demonstrates a novel approach to training deep neural networks using a Mutual Information (MI)-driven, decaying Learning Rate (LR), Stochastic Gradient Descent (SGD) algorithm. MI between the output of the neural network and true outcomes is used to adaptively set the LR for the network,...

Full description

Bibliographic Details
Main Author:	Shrihari Vasudevan
Format:	Article
Language:	English
Published:	MDPI AG 2020-05-01
Series:	Entropy
Subjects:	deep neural networks stochastic gradient descent mutual information adaptive learning rate
Online Access:	https://www.mdpi.com/1099-4300/22/5/560

Similar Items

Damped Newton Stochastic Gradient Descent Method for Neural Networks Training
by: Jingcheng Zhou, et al.
Published: (2021-06-01)

Recent Advances in Stochastic Gradient Descent in Deep Learning
by: Yingjie Tian, et al.
Published: (2023-01-01)

Adaptive Stochastic Conjugate Gradient Optimization for Backpropagation Neural Networks
by: Ibrahim Abaker Targio Hashem, et al.
Published: (2024-01-01)

A Geometric Interpretation of Stochastic Gradient Descent Using Diffusion Metrics
by: Rita Fioresi, et al.
Published: (2020-01-01)

Adaptive Stochastic Gradient Descent Method for Convex and Non-Convex Optimization
by: Ruijuan Chen, et al.
Published: (2022-11-01)

Stochastic gradient descent with random label noises: doubly stochastic models and inference stabilizer
by: Haoyi Xiong, et al.
Published: (2024-01-01)

Distributed Stochastic Gradient Descent With Compressed and Skipped Communication
by: Tran Thi Phuong, et al.
Published: (2023-01-01)

Modified‎ ‎Step‎ ‎Size‎ ‎for‎ ‎Enhanced‎ ‎Stochastic Gradient Descent‎: ‎Convergence and Experiments
by: Mahsa Soheil shamaee, et al.
Published: (2024-09-01)

Pipelined Stochastic Gradient Descent with Taylor Expansion
by: Bongwon Jang, et al.
Published: (2023-10-01)

A Novel Sine Step Size for Warm-Restart Stochastic Gradient Descent
by: Mahsa Soheil Shamaee, et al.
Published: (2024-12-01)

Adaptive Gradient Estimation Stochastic Parallel Gradient Descent Algorithm for Laser Beam Cleanup
by: Shiqing Ma, et al.
Published: (2021-05-01)

Stochastic Gradient Descent for Kernel-Based Maximum Correntropy Criterion
by: Tiankai Li, et al.
Published: (2024-12-01)

Counterexamples for Noise Models of Stochastic Gradients
by: Vivak Patel
Published: (2023-12-01)

The Improved Stochastic Fractional Order Gradient Descent Algorithm
by: Yang Yang, et al.
Published: (2023-08-01)

Public Security Video Surveillance Image Restoration Based on Stochastic Gradient Descent Algorithm
by: Yuxiao MENG, et al.
Published: (2022-11-01)

Adaptive Human–Machine Evaluation Framework Using Stochastic Gradient Descent-Based Reinforcement Learning for Dynamic Competing Network
by: Jinbae Kim, et al.
Published: (2020-04-01)

aSGD: Stochastic Gradient Descent with Adaptive Batch Size for Every Parameter
by: Haoze Shi, et al.
Published: (2022-03-01)

Estimation of simultaneous equation models by backpropagation method using stochastic gradient descent
by: Belén Pérez-Sánchez, et al.
Published: (2024-10-01)

Performance Evaluation of Gradient Descent Optimizers in Estuarine Turbidity Estimation with Multilayer Perceptron and Sentinel-2 Imagery
by: Naledzani Ndou, et al.
Published: (2024-10-01)

Stochastic gradient descent algorithm preserving differential privacy in MapReduce framework
by: Yihan YU, et al.
Published: (2018-01-01)

Stochastic gradient descent algorithm preserving differential privacy in MapReduce framework
by: Yihan YU, et al.
Published: (2018-01-01)

AdaCB: An Adaptive Gradient Method with Convergence Range Bound of Learning Rate
by: Xuanzhi Liao, et al.
Published: (2022-09-01)

Beyond Stochastic Gradient Descent for Matrix Completion Based Indoor Localization
by: Wafa Njima, et al.
Published: (2019-06-01)

A Novel Framework for Abnormal Risk Classification over Fetal Nuchal Translucency Using Adaptive Stochastic Gradient Descent Algorithm
by: Deepti Verma, et al.
Published: (2022-10-01)

Gradient Descent Batch Clustering for Image Classification
by: Jae-Sam Park
Published: (2023-07-01)

Fed-DeepONet: Stochastic Gradient-Based Federated Training of Deep Operator Networks
by: Christian Moya, et al.
Published: (2022-09-01)

Belief-Rule-Base Inference Method Based on Gradient Descent With Momentum
by: Yu Guan, et al.
Published: (2021-01-01)

Training Neural Networks by Time-Fractional Gradient Descent
by: Jingyi Xie, et al.
Published: (2022-09-01)

Function approximation method based on weights gradient descent in reinforcement learning
by: Xiaoyan QIN, et al.
Published: (2023-08-01)

Function approximation method based on weights gradient descent in reinforcement learning
by: Xiaoyan QIN, Yuhan LIU, Yunlong XU, Bin LI
Published: (2023-08-01)

Phase Optimized Computer-Generated Holographic Video Calculation With Frame Interpolation Using Gradient Descent Algorithm
by: Gyeongsu Jin, et al.
Published: (2024-01-01)

Forest fire risk assessment model optimized by stochastic average gradient descent
by: Zexin Fu, et al.
Published: (2025-01-01)

Fractional Stochastic Search Algorithms: Modelling Complex Systems via AI
by: Bodo Herzog
Published: (2023-04-01)

The Steepest Descent Method Using the Empirical Mode Gradient Decomposition
by: Vasiliy Esaulov, et al.
Published: (2020-03-01)

GIS-Based Comparative Study of the Bayesian Network, Decision Table, Radial Basis Function Network and Stochastic Gradient Descent for the Spatial Prediction of Landslide Susceptibility
by: Junpeng Huang, et al.
Published: (2022-03-01)

PID controller‐based adaptive gradient optimizer for deep neural networks
by: Mingjun Dai, et al.
Published: (2023-10-01)

Variable-Parameter Impedance Control of Manipulator Based on RBFNN and Gradient Descent
by: Linshen Li, et al.
Published: (2024-12-01)

A Zeroth-Order Adaptive Learning Rate Method to Reduce Cost of Hyperparameter Tuning for Deep Learning
by: Yanan Li, et al.
Published: (2021-10-01)

A Static Security Region Analysis of New Power Systems Based on Improved Stochastic–Batch Gradient Pile Descent
by: Jiahui Wu, et al.
Published: (2024-04-01)

A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions
by: R.I. Abdulkadirov, et al.
Published: (2023-02-01)