A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels

Deep neural networks (DNNs) require large amounts of labeled data for model training. However, label noise is a common problem in datasets due to the difficulty of classification and high cost of labeling processes. Introducing the concepts of curriculum learning and progressive learning, this paper...

Full description

Bibliographic Details
Main Authors:	Xuguo Yan, Xuhui Xia, Lei Wang, Zelin Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2022-12-01
Series:	Applied Sciences
Subjects:	DNNs curriculum learning progressive learning noisy labels image classification
Online Access:	https://www.mdpi.com/2076-3417/12/24/12754

_version_	1827641986543255552
author	Xuguo Yan Xuhui Xia Lei Wang Zelin Zhang
author_facet	Xuguo Yan Xuhui Xia Lei Wang Zelin Zhang
author_sort	Xuguo Yan
collection	DOAJ
description	Deep neural networks (DNNs) require large amounts of labeled data for model training. However, label noise is a common problem in datasets due to the difficulty of classification and high cost of labeling processes. Introducing the concepts of curriculum learning and progressive learning, this paper presents a novel solution that is able to handle massive noisy labels and improve model generalization ability. It proposes a new network model training strategy that considers mislabeled samples directly in the network training process. The new learning curriculum is designed to measures the complexity of the data with their distribution density in a feature space. The sample data in each category are then divided into easy-to-classify (clean samples), relatively easy-to-classify, and hard-to-classify (noisy samples) subsets according to the smallest intra-class local density with each cluster. On this basis, DNNs are trained progressively in three stages, from easy to hard, i.e., from clean to noisy samples. The experimental results demonstrate that the accuracy of image classification can be improved through data augmentation, and the classification accuracy of the proposed method is clearly higher than that of standard Inception_v2 for the NEU dataset after data augmentation, when the proportion of noisy labels in the training set does not exceed 60%. With 50% noisy labels in the training set, the classification accuracy of the proposed method outperformed recent state-of-the-art label noise learning methods, CleanNet and MentorNet. The proposed method also performed well in practical applications, where the number of noisy labels was uncertain and unevenly distributed. In this case, the proposed method not only can alleviate the adverse effects of noisy labels, but it can also improve the generalization ability of standard deep networks and their overall capability.
first_indexed	2024-03-09T17:22:19Z
format	Article
id	doaj.art-a18089f78061431ea3d22bceb93d9899
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-09T17:22:19Z
publishDate	2022-12-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-a18089f78061431ea3d22bceb93d98992023-11-24T13:04:11ZengMDPI AGApplied Sciences2076-34172022-12-0112241275410.3390/app122412754A Progressive Deep Neural Network Training Method for Image Classification with Noisy LabelsXuguo Yan0Xuhui Xia1Lei Wang2Zelin Zhang3Key Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, ChinaKey Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, ChinaKey Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, ChinaKey Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan University of Science and Technology, Wuhan 430081, ChinaDeep neural networks (DNNs) require large amounts of labeled data for model training. However, label noise is a common problem in datasets due to the difficulty of classification and high cost of labeling processes. Introducing the concepts of curriculum learning and progressive learning, this paper presents a novel solution that is able to handle massive noisy labels and improve model generalization ability. It proposes a new network model training strategy that considers mislabeled samples directly in the network training process. The new learning curriculum is designed to measures the complexity of the data with their distribution density in a feature space. The sample data in each category are then divided into easy-to-classify (clean samples), relatively easy-to-classify, and hard-to-classify (noisy samples) subsets according to the smallest intra-class local density with each cluster. On this basis, DNNs are trained progressively in three stages, from easy to hard, i.e., from clean to noisy samples. The experimental results demonstrate that the accuracy of image classification can be improved through data augmentation, and the classification accuracy of the proposed method is clearly higher than that of standard Inception_v2 for the NEU dataset after data augmentation, when the proportion of noisy labels in the training set does not exceed 60%. With 50% noisy labels in the training set, the classification accuracy of the proposed method outperformed recent state-of-the-art label noise learning methods, CleanNet and MentorNet. The proposed method also performed well in practical applications, where the number of noisy labels was uncertain and unevenly distributed. In this case, the proposed method not only can alleviate the adverse effects of noisy labels, but it can also improve the generalization ability of standard deep networks and their overall capability.https://www.mdpi.com/2076-3417/12/24/12754DNNscurriculum learningprogressive learningnoisy labelsimage classification
spellingShingle	Xuguo Yan Xuhui Xia Lei Wang Zelin Zhang A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels Applied Sciences DNNs curriculum learning progressive learning noisy labels image classification
title	A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels
title_full	A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels
title_fullStr	A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels
title_full_unstemmed	A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels
title_short	A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels
title_sort	progressive deep neural network training method for image classification with noisy labels
topic	DNNs curriculum learning progressive learning noisy labels image classification
url	https://www.mdpi.com/2076-3417/12/24/12754
work_keys_str_mv	AT xuguoyan aprogressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT xuhuixia aprogressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT leiwang aprogressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT zelinzhang aprogressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT xuguoyan progressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT xuhuixia progressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT leiwang progressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels AT zelinzhang progressivedeepneuralnetworktrainingmethodforimageclassificationwithnoisylabels

A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels

Similar Items