Accumulated decoupled learning with gradient staleness mitigation for convolutional neural networks

Accumulated decoupled learning with gradient staleness mitigation for convolutional neural networks

Gradient staleness is a major side effect in decoupled learning when training convolutional neural networks asynchronously. Existing methods that ignore this effect might result in reduced generalization and even divergence. In this paper, we propose an accumulated decoupled learning (ADL), wh...

Full description

Bibliographic Details
Main Authors:	Zhuang, Huiping, Weng, Zhenyu, Luo, Fulin, Toh, Kar-Ann, Li, Haizhou, Lin, Zhiping
Other Authors:	School of Electrical and Electronic Engineering
Format:	Conference Paper
Language:	English
Published:	2024
Subjects:	Computer and Information Science Convolutional neural networks Delayed gradients-based methods
Online Access:	https://hdl.handle.net/10356/174480 https://icml.cc/virtual/2021/index.html

Similar Items

Spoofing speech detection using temporal convolutional neural network
by: Xiao, Xiong, et al.
Published: (2018)

An energy-efficient convolution unit for depthwise separable convolutional neural networks
by: Chong, Yi Sheng, et al.
Published: (2021)

Stability analysis of delayed neural networks via compound-parameter-based integral inequality
by: Xue, Wenlong, et al.
Published: (2024)

Annual dilated convolution neural network for newbuilding ship prices forecasting
by: Gao, Ruobin, et al.
Published: (2022)

Ultra-high-speed accelerator architecture for convolutional neural network based on processing-in-memory using resistive random access memory
by: Wang, Hongzhe, et al.
Published: (2023)

Speeding up deep neural network training with decoupled and analytic learning
by: Zhuang, Huiping
Published: (2021)

Image-based microstructure classification of mortar and paste using convolutional neural networks and transfer learning
by: Qian, Hanjie, et al.
Published: (2022)

Almost minimax design of FIR filter using an IRLS algorithm without matrix inversion
by: Zhao, Ruijie, et al.
Published: (2018)

Automated segmentation of visceral, deep subcutaneous, and superficial subcutaneous adipose tissue volumes in MRI of neonates and young children
by: Kway, Yeshe Manuel, et al.
Published: (2023)

An integrated intelligent system for breast cancer detection at early stages using IR images and machine learning methods with explainability
by: Aidossov, Nurduman, et al.
Published: (2023)

Breaking the symmetry : gradient in NiFe layered double hydroxide nanoarrays for efficient oxygen evolution
by: Zhou, Daojin, et al.
Published: (2021)

ACIL: analytic class-incremental learning with absolute memorization and privacy protection
by: Zhuang, Huiping, et al.
Published: (2024)

A multisensory interaction framework for human-cyber–physical system based on graph convolutional networks
by: Qi, Wenqian, et al.
Published: (2024)

Residual learning diagnosis detection: an advanced residual learning diagnosis detection system for COVID-19 in industrial internet of things
by: Zhang, Mingdong, et al.
Published: (2022)

Toward achieving robust low-level and high-level scene parsing
by: Shuai, Bing, et al.
Published: (2020)

Motion context network for weakly supervised object detection in videos
by: Jin, Ruibing, et al.
Published: (2022)

Deformable pose traversal convolution for 3D action and gesture recognition
by: Weng, Junwu, et al.
Published: (2020)

Influence of no-core fiber on the focusing performance of an ultra-small gradient-index fiber probe
by: Bi, Shubo, et al.
Published: (2020)

Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
by: Cai, Yujun, et al.
Published: (2019)

Preventing delayed diagnosis of cancer: clinicians’ views on main problems and solutions
by: Car, Lorainne Tudor, et al.
Published: (2018)

Simultaneous super-resolution and classification of lung disease scans
by: Heba M. Emara, et al.
Published: (2023)

Determining bus stop locations using deep learning and time filtering
by: Piriyataravet, Jitpinun, et al.
Published: (2022)

Towards best practice of interpreting deep learning models for EEG-based brain computer interfaces
by: Cui, Jian, et al.
Published: (2023)

Effect of heat treatment on microstructures and mechanical properties of Inconel 718 additively manufactured using gradient laser power
by: Xu, Luming, et al.
Published: (2023)

Text classification using graph convolutional network
by: Koh, Jiahui
Published: (2024)

3D deep learning on medical images : a review
by: Singh, Satya P., et al.
Published: (2021)

TSception: capturing temporal dynamics and spatial asymmetry from EEG for emotion recognition
by: Ding, Yi, et al.
Published: (2024)

End-fire surface wave antenna with metasurface coating
by: Wang, Ping, et al.
Published: (2018)

Emergence of white organic light-emitting diodes based on thermally activated delayed fluorescence
by: Xiao, Peng, et al.
Published: (2018)

LKAW: a robust watermarking method based on large kernel convolution and adaptive weight assignment
by: Zhang, Xiaorui, et al.
Published: (2023)

Augmented EMD for complex-valued univariate signals
by: Oh, Beom-Seok, et al.
Published: (2020)

A frequency-domain output-constrained active noise control algorithm based on an intuitive circulant convolutional penalty factor
by: Shi, Dongyuan, et al.
Published: (2023)

A byte sequence is worth an image: CNN for file fragment classification using bit shift and n-gram embeddings
by: Liu, Wenyang, et al.
Published: (2024)

AB078. Patterns of treatment delay in patients with symptomatic metastatic epidural spinal cord compression
by: Hui, Si Jian, et al.
Published: (2024)

Six-year removal of co-dominant grasses alleviated competitive pressure on subdominant grasses but dominant shrub removal had neutral effects in a subalpine ecosystem
by: Li, Wenjin, et al.
Published: (2021)

In-plane surface wave in a classical elastic half-space covered by a surface layer with microstructure
by: Fan, Hui, et al.
Published: (2021)

Load forecasting for distribution network and electric vehicle charging optimisation to mitigate load fluctuations
by: Ng, Xin Yi
Published: (2024)

Graph edit distance reward : learning to edit scene graph
by: Chen, Lichang, et al.
Published: (2020)

A miniaturized anechoic chamber: omnidirectional impedance matching based on truncated spatial Kramers–Kronig medium
by: Li, Quanping, et al.
Published: (2022)

Is East Asia decoupled from the US?
by: Huynh, Nha Khanh, et al.
Published: (2011)