A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data

The class imbalance problem has been reported to exist in remote sensing and hinders the classification performance of many machine learning algorithms. Several technologies, such as data sampling methods, feature selection-based methods, and ensemble-based methods, have been proposed to solve the c...

Full description

Bibliographic Details
Main Authors: Daying Quan, Wei Feng, Gabriel Dauphin, Xiaofeng Wang, Wenjiang Huang, Mengdao Xing
Format: Article
Language:English
Published: MDPI AG 2022-08-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/14/15/3765
_version_ 1797432733716185088
author Daying Quan
Wei Feng
Gabriel Dauphin
Xiaofeng Wang
Wenjiang Huang
Mengdao Xing
author_facet Daying Quan
Wei Feng
Gabriel Dauphin
Xiaofeng Wang
Wenjiang Huang
Mengdao Xing
author_sort Daying Quan
collection DOAJ
description The class imbalance problem has been reported to exist in remote sensing and hinders the classification performance of many machine learning algorithms. Several technologies, such as data sampling methods, feature selection-based methods, and ensemble-based methods, have been proposed to solve the class imbalance problem. However, these methods suffer from the loss of useful information or from artificial noise, or result in overfitting. A novel double ensemble algorithm is proposed to deal with the multi-class imbalance problem of the hyperspectral image in this paper. This method first computes the feature importance values of the hyperspectral data via an ensemble model, then produces several balanced data sets based on oversampling and builds a number of classifiers. Finally, the classification results of these diversity classifiers are combined according to a specific ensemble rule. In the experiment, different data-handling methods and classification methods including random undersampling (RUS), random oversampling (ROS), Adaboost, Bagging, and random forest are compared with the proposed double random forest method. The experimental results on three imbalanced hyperspectral data sets demonstrate the effectiveness of the proposed algorithm.
first_indexed 2024-03-09T10:05:54Z
format Article
id doaj.art-dd7c0752a057404992c4ccbcb9a0585b
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-09T10:05:54Z
publishDate 2022-08-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-dd7c0752a057404992c4ccbcb9a0585b2023-12-01T23:08:42ZengMDPI AGRemote Sensing2072-42922022-08-011415376510.3390/rs14153765A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral DataDaying Quan0Wei Feng1Gabriel Dauphin2Xiaofeng Wang3Wenjiang Huang4Mengdao Xing5Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, College of Information Engineering, China Jiliang University, Hangzhou 310018, ChinaHangzhou Institute of Technology, Xidian University, Hangzhou 311200, ChinaLaboratory of Information Processing and Transmission, L2TI, Institut Galilée, University Paris XIII, 93430 Villetaneuse, FranceKey Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, College of Information Engineering, China Jiliang University, Hangzhou 310018, ChinaKey Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaHangzhou Institute of Technology, Xidian University, Hangzhou 311200, ChinaThe class imbalance problem has been reported to exist in remote sensing and hinders the classification performance of many machine learning algorithms. Several technologies, such as data sampling methods, feature selection-based methods, and ensemble-based methods, have been proposed to solve the class imbalance problem. However, these methods suffer from the loss of useful information or from artificial noise, or result in overfitting. A novel double ensemble algorithm is proposed to deal with the multi-class imbalance problem of the hyperspectral image in this paper. This method first computes the feature importance values of the hyperspectral data via an ensemble model, then produces several balanced data sets based on oversampling and builds a number of classifiers. Finally, the classification results of these diversity classifiers are combined according to a specific ensemble rule. In the experiment, different data-handling methods and classification methods including random undersampling (RUS), random oversampling (ROS), Adaboost, Bagging, and random forest are compared with the proposed double random forest method. The experimental results on three imbalanced hyperspectral data sets demonstrate the effectiveness of the proposed algorithm.https://www.mdpi.com/2072-4292/14/15/3765classificationremote sensinghyperspectral imageimbalance learningdata sampling
spellingShingle Daying Quan
Wei Feng
Gabriel Dauphin
Xiaofeng Wang
Wenjiang Huang
Mengdao Xing
A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
Remote Sensing
classification
remote sensing
hyperspectral image
imbalance learning
data sampling
title A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
title_full A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
title_fullStr A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
title_full_unstemmed A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
title_short A Novel Double Ensemble Algorithm for the Classification of Multi-Class Imbalanced Hyperspectral Data
title_sort novel double ensemble algorithm for the classification of multi class imbalanced hyperspectral data
topic classification
remote sensing
hyperspectral image
imbalance learning
data sampling
url https://www.mdpi.com/2072-4292/14/15/3765
work_keys_str_mv AT dayingquan anoveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT weifeng anoveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT gabrieldauphin anoveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT xiaofengwang anoveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT wenjianghuang anoveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT mengdaoxing anoveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT dayingquan noveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT weifeng noveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT gabrieldauphin noveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT xiaofengwang noveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT wenjianghuang noveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata
AT mengdaoxing noveldoubleensemblealgorithmfortheclassificationofmulticlassimbalancedhyperspectraldata