Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping

Ensemble learning methods have been widely used due to their remarkable generalized performance, but their potential in landslide spatial prediction application is not fully studied. To take full advantage of ensemble learning techniques, the classification and regression tree classifier and four tr...

Full description

Bibliographic Details
Main Authors: Jiahui Song, Yi Wang, Zhice Fang, Ling Peng, Haoyuan Hong
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9157927/
_version_ 1818438428198436864
author Jiahui Song
Yi Wang
Zhice Fang
Ling Peng
Haoyuan Hong
author_facet Jiahui Song
Yi Wang
Zhice Fang
Ling Peng
Haoyuan Hong
author_sort Jiahui Song
collection DOAJ
description Ensemble learning methods have been widely used due to their remarkable generalized performance, but their potential in landslide spatial prediction application is not fully studied. To take full advantage of ensemble learning techniques, the classification and regression tree classifier and four tree-based ensemble classifiers of random forest, extremely randomized tree, gradient boosting decision trees, and extreme gradient boosting decision trees are used in this study for landslide susceptibility assessment. Specifically, a stacking ensemble learning framework coupled with embedded feature selection is presented, consisting of multiple tree-based classifiers mentioned previously as base learners and logistic regression as a metalearner in a two-layer structure. In the study area of Yongxin, China, 364 historical landslide locations were first randomly partitioned into a ratio of 7/3 for training and testing the model. Then, a spatial database of 16 landslide causative factors was constructed for landslide prediction. Meanwhile, the relative importance of these factors were quantified by using the total number of feature splits and the average Gini index during the training process, and a novel embedded feature selection method was used in the base learner of the proposed framework to further improve the computational efficiency and predictive performance by allowing each base learner to obtain its own optimal subfeature space. Finally, different methods were assessed by using several evaluation criteria. Experimental results demonstrated that the proposed ensemble learning framework had the highest area under the curve value of 0.864, and it is more effective than the conventional tree-based classifiers and other ensemble learning methods.
first_indexed 2024-12-14T17:40:24Z
format Article
id doaj.art-d90909fc663e47849f11ab37a18ac7b6
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-12-14T17:40:24Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-d90909fc663e47849f11ab37a18ac7b62022-12-21T22:52:52ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352020-01-01134642466210.1109/JSTARS.2020.30141439157927Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility MappingJiahui Song0Yi Wang1https://orcid.org/0000-0002-1347-7030Zhice Fang2https://orcid.org/0000-0003-4414-8712Ling Peng3Haoyuan Hong4https://orcid.org/0000-0001-6224-069XInstitute of Geophysics and Geomatics, China University of Geosciences, Wuhan, ChinaInstitute of Geophysics and Geomatics, China University of Geosciences, Wuhan, ChinaInstitute of Geophysics and Geomatics, China University of Geosciences, Wuhan, ChinaChina Institute of Geo-Environment Monitoring, Beijing, ChinaDepartment of Geography and Regional Research, University of Vienna, Vienna, AustriaEnsemble learning methods have been widely used due to their remarkable generalized performance, but their potential in landslide spatial prediction application is not fully studied. To take full advantage of ensemble learning techniques, the classification and regression tree classifier and four tree-based ensemble classifiers of random forest, extremely randomized tree, gradient boosting decision trees, and extreme gradient boosting decision trees are used in this study for landslide susceptibility assessment. Specifically, a stacking ensemble learning framework coupled with embedded feature selection is presented, consisting of multiple tree-based classifiers mentioned previously as base learners and logistic regression as a metalearner in a two-layer structure. In the study area of Yongxin, China, 364 historical landslide locations were first randomly partitioned into a ratio of 7/3 for training and testing the model. Then, a spatial database of 16 landslide causative factors was constructed for landslide prediction. Meanwhile, the relative importance of these factors were quantified by using the total number of feature splits and the average Gini index during the training process, and a novel embedded feature selection method was used in the base learner of the proposed framework to further improve the computational efficiency and predictive performance by allowing each base learner to obtain its own optimal subfeature space. Finally, different methods were assessed by using several evaluation criteria. Experimental results demonstrated that the proposed ensemble learning framework had the highest area under the curve value of 0.864, and it is more effective than the conventional tree-based classifiers and other ensemble learning methods.https://ieeexplore.ieee.org/document/9157927/Embedded feature selectionensemble learninglandslides susceptibility mappingtree-based classifiers
spellingShingle Jiahui Song
Yi Wang
Zhice Fang
Ling Peng
Haoyuan Hong
Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Embedded feature selection
ensemble learning
landslides susceptibility mapping
tree-based classifiers
title Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
title_full Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
title_fullStr Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
title_full_unstemmed Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
title_short Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
title_sort potential of ensemble learning to improve tree based classifiers for landslide susceptibility mapping
topic Embedded feature selection
ensemble learning
landslides susceptibility mapping
tree-based classifiers
url https://ieeexplore.ieee.org/document/9157927/
work_keys_str_mv AT jiahuisong potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping
AT yiwang potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping
AT zhicefang potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping
AT lingpeng potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping
AT haoyuanhong potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping