Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network

Objective Machine learning has been widely applied in the fields of collapse, landslide and debris flow susceptibility analysis. The selection of nonhazard samples is a key issue in landslide susceptibility analysis. Traditional random sampling and manual labelling methods may involve randomness and...

Full description

Bibliographic Details
Main Authors: Rilang DENG, Qinghua ZHANG, Wei LIU, Lingwei CHEN, Jianhui TAN, Zemao GAO, Xianchang ZHENG
Format: Article
Language:zho
Published: Editorial Department of Bulletin of Geological Science and Technology 2024-03-01
Series:地质科技通报
Subjects:
Online Access:https://dzkjqb.cug.edu.cn/en/article/doi/10.19509/j.cnki.dzkq.tb20220535
_version_ 1797200604472279040
author Rilang DENG
Qinghua ZHANG
Wei LIU
Lingwei CHEN
Jianhui TAN
Zemao GAO
Xianchang ZHENG
author_facet Rilang DENG
Qinghua ZHANG
Wei LIU
Lingwei CHEN
Jianhui TAN
Zemao GAO
Xianchang ZHENG
author_sort Rilang DENG
collection DOAJ
description Objective Machine learning has been widely applied in the fields of collapse, landslide and debris flow susceptibility analysis. The selection of nonhazard samples is a key issue in landslide susceptibility analysis. Traditional random sampling and manual labelling methods may involve randomness and subjectivity. Methods In view of the potential randomness and representativeness of noncollapse samples, this paper considered soil collapse susceptibility evaluation a positive-unlabelled (PU) learning problem and proposes a two-step convolutional neural network framework (ISpy-CNN) that combines an information value model and the Spy technique. First, 15 collapse-related factors were selected for modelling based on the geomorphological, geological, hydrological, and artificial environmental conditions of the study area. Low-information-value samples that were able to map the distribution structure of noncollapsing samples were screened by the information value model. Then, through the Spy technique and training the CNN model, negative samples with high confidence were identified from low-information-value samples that were classified as noncollapsed samples. Finally, based on the framework and traditional random sampling, we used support vector machine (SVM) and random forest (RF) models to compare and verify the reliability, prediction accuracy and data sensitivity of the proposed learning framework and other models. Results The results illustrate that the proposed ISpy-CNN method can improve the accuracy, F1 value, sensitivity and specificity on the validation set by 6.82%, 6.82%, 6.82%, 8.23%, respectively compared to random sampling and 2.86%, 2.89%, 2.86%, 2.31%, respectively compared to the traditional Spy technique. The prediction accuracy of step 2 in PU learning using the CNN model is higher than that of the RF and SVM models. The sample set screened by the ISpy-CNN framework exhibited greater stability, prediction accuracy and growth rate than those screened by the traditional Spy technique by adding the same number of training samples. Conclusion The ISpy-CNN framework proposed in this paper can better assist in the selection of nonhazard samples and real collapse spatial distribution maps, and the results of the framework are more consistent with the actual collapse distributions.
first_indexed 2024-04-24T07:34:17Z
format Article
id doaj.art-a6e34f6c8a8446e59bda874ab09e4937
institution Directory Open Access Journal
issn 2096-8523
language zho
last_indexed 2024-04-24T07:34:17Z
publishDate 2024-03-01
publisher Editorial Department of Bulletin of Geological Science and Technology
record_format Article
series 地质科技通报
spelling doaj.art-a6e34f6c8a8446e59bda874ab09e49372024-04-20T10:18:33ZzhoEditorial Department of Bulletin of Geological Science and Technology地质科技通报2096-85232024-03-0143218620010.19509/j.cnki.dzkq.tb20220535dzkjtb-43-2-186Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural networkRilang DENG0Qinghua ZHANG1Wei LIU2Lingwei CHEN3Jianhui TAN4Zemao GAO5Xianchang ZHENG6School of Civil Engineering, Guangzhou University, Guangzhou 510006, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaSchool of Civil Engineering, Guangzhou University, Guangzhou 510006, ChinaSchool of Civil Engineering, Guangzhou University, Guangzhou 510006, ChinaObjective Machine learning has been widely applied in the fields of collapse, landslide and debris flow susceptibility analysis. The selection of nonhazard samples is a key issue in landslide susceptibility analysis. Traditional random sampling and manual labelling methods may involve randomness and subjectivity. Methods In view of the potential randomness and representativeness of noncollapse samples, this paper considered soil collapse susceptibility evaluation a positive-unlabelled (PU) learning problem and proposes a two-step convolutional neural network framework (ISpy-CNN) that combines an information value model and the Spy technique. First, 15 collapse-related factors were selected for modelling based on the geomorphological, geological, hydrological, and artificial environmental conditions of the study area. Low-information-value samples that were able to map the distribution structure of noncollapsing samples were screened by the information value model. Then, through the Spy technique and training the CNN model, negative samples with high confidence were identified from low-information-value samples that were classified as noncollapsed samples. Finally, based on the framework and traditional random sampling, we used support vector machine (SVM) and random forest (RF) models to compare and verify the reliability, prediction accuracy and data sensitivity of the proposed learning framework and other models. Results The results illustrate that the proposed ISpy-CNN method can improve the accuracy, F1 value, sensitivity and specificity on the validation set by 6.82%, 6.82%, 6.82%, 8.23%, respectively compared to random sampling and 2.86%, 2.89%, 2.86%, 2.31%, respectively compared to the traditional Spy technique. The prediction accuracy of step 2 in PU learning using the CNN model is higher than that of the RF and SVM models. The sample set screened by the ISpy-CNN framework exhibited greater stability, prediction accuracy and growth rate than those screened by the traditional Spy technique by adding the same number of training samples. Conclusion The ISpy-CNN framework proposed in this paper can better assist in the selection of nonhazard samples and real collapse spatial distribution maps, and the results of the framework are more consistent with the actual collapse distributions.https://dzkjqb.cug.edu.cn/en/article/doi/10.19509/j.cnki.dzkq.tb20220535collapsesusceptibility evaluationpositive and unlabeled (pu) learningspy techniqueinformation valueconvolutional neural networkrandom forestsupport vector machine
spellingShingle Rilang DENG
Qinghua ZHANG
Wei LIU
Lingwei CHEN
Jianhui TAN
Zemao GAO
Xianchang ZHENG
Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
地质科技通报
collapse
susceptibility evaluation
positive and unlabeled (pu) learning
spy technique
information value
convolutional neural network
random forest
support vector machine
title Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
title_full Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
title_fullStr Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
title_full_unstemmed Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
title_short Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
title_sort collapse susceptibility evaluation based on an improved two step sampling strategy and a convolutional neural network
topic collapse
susceptibility evaluation
positive and unlabeled (pu) learning
spy technique
information value
convolutional neural network
random forest
support vector machine
url https://dzkjqb.cug.edu.cn/en/article/doi/10.19509/j.cnki.dzkq.tb20220535
work_keys_str_mv AT rilangdeng collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork
AT qinghuazhang collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork
AT weiliu collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork
AT lingweichen collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork
AT jianhuitan collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork
AT zemaogao collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork
AT xianchangzheng collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork