Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
Ensemble learning methods have been widely used due to their remarkable generalized performance, but their potential in landslide spatial prediction application is not fully studied. To take full advantage of ensemble learning techniques, the classification and regression tree classifier and four tr...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9157927/ |
_version_ | 1818438428198436864 |
---|---|
author | Jiahui Song Yi Wang Zhice Fang Ling Peng Haoyuan Hong |
author_facet | Jiahui Song Yi Wang Zhice Fang Ling Peng Haoyuan Hong |
author_sort | Jiahui Song |
collection | DOAJ |
description | Ensemble learning methods have been widely used due to their remarkable generalized performance, but their potential in landslide spatial prediction application is not fully studied. To take full advantage of ensemble learning techniques, the classification and regression tree classifier and four tree-based ensemble classifiers of random forest, extremely randomized tree, gradient boosting decision trees, and extreme gradient boosting decision trees are used in this study for landslide susceptibility assessment. Specifically, a stacking ensemble learning framework coupled with embedded feature selection is presented, consisting of multiple tree-based classifiers mentioned previously as base learners and logistic regression as a metalearner in a two-layer structure. In the study area of Yongxin, China, 364 historical landslide locations were first randomly partitioned into a ratio of 7/3 for training and testing the model. Then, a spatial database of 16 landslide causative factors was constructed for landslide prediction. Meanwhile, the relative importance of these factors were quantified by using the total number of feature splits and the average Gini index during the training process, and a novel embedded feature selection method was used in the base learner of the proposed framework to further improve the computational efficiency and predictive performance by allowing each base learner to obtain its own optimal subfeature space. Finally, different methods were assessed by using several evaluation criteria. Experimental results demonstrated that the proposed ensemble learning framework had the highest area under the curve value of 0.864, and it is more effective than the conventional tree-based classifiers and other ensemble learning methods. |
first_indexed | 2024-12-14T17:40:24Z |
format | Article |
id | doaj.art-d90909fc663e47849f11ab37a18ac7b6 |
institution | Directory Open Access Journal |
issn | 2151-1535 |
language | English |
last_indexed | 2024-12-14T17:40:24Z |
publishDate | 2020-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
spelling | doaj.art-d90909fc663e47849f11ab37a18ac7b62022-12-21T22:52:52ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352020-01-01134642466210.1109/JSTARS.2020.30141439157927Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility MappingJiahui Song0Yi Wang1https://orcid.org/0000-0002-1347-7030Zhice Fang2https://orcid.org/0000-0003-4414-8712Ling Peng3Haoyuan Hong4https://orcid.org/0000-0001-6224-069XInstitute of Geophysics and Geomatics, China University of Geosciences, Wuhan, ChinaInstitute of Geophysics and Geomatics, China University of Geosciences, Wuhan, ChinaInstitute of Geophysics and Geomatics, China University of Geosciences, Wuhan, ChinaChina Institute of Geo-Environment Monitoring, Beijing, ChinaDepartment of Geography and Regional Research, University of Vienna, Vienna, AustriaEnsemble learning methods have been widely used due to their remarkable generalized performance, but their potential in landslide spatial prediction application is not fully studied. To take full advantage of ensemble learning techniques, the classification and regression tree classifier and four tree-based ensemble classifiers of random forest, extremely randomized tree, gradient boosting decision trees, and extreme gradient boosting decision trees are used in this study for landslide susceptibility assessment. Specifically, a stacking ensemble learning framework coupled with embedded feature selection is presented, consisting of multiple tree-based classifiers mentioned previously as base learners and logistic regression as a metalearner in a two-layer structure. In the study area of Yongxin, China, 364 historical landslide locations were first randomly partitioned into a ratio of 7/3 for training and testing the model. Then, a spatial database of 16 landslide causative factors was constructed for landslide prediction. Meanwhile, the relative importance of these factors were quantified by using the total number of feature splits and the average Gini index during the training process, and a novel embedded feature selection method was used in the base learner of the proposed framework to further improve the computational efficiency and predictive performance by allowing each base learner to obtain its own optimal subfeature space. Finally, different methods were assessed by using several evaluation criteria. Experimental results demonstrated that the proposed ensemble learning framework had the highest area under the curve value of 0.864, and it is more effective than the conventional tree-based classifiers and other ensemble learning methods.https://ieeexplore.ieee.org/document/9157927/Embedded feature selectionensemble learninglandslides susceptibility mappingtree-based classifiers |
spellingShingle | Jiahui Song Yi Wang Zhice Fang Ling Peng Haoyuan Hong Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Embedded feature selection ensemble learning landslides susceptibility mapping tree-based classifiers |
title | Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping |
title_full | Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping |
title_fullStr | Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping |
title_full_unstemmed | Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping |
title_short | Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping |
title_sort | potential of ensemble learning to improve tree based classifiers for landslide susceptibility mapping |
topic | Embedded feature selection ensemble learning landslides susceptibility mapping tree-based classifiers |
url | https://ieeexplore.ieee.org/document/9157927/ |
work_keys_str_mv | AT jiahuisong potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping AT yiwang potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping AT zhicefang potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping AT lingpeng potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping AT haoyuanhong potentialofensemblelearningtoimprovetreebasedclassifiersforlandslidesusceptibilitymapping |