Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network
Objective Machine learning has been widely applied in the fields of collapse, landslide and debris flow susceptibility analysis. The selection of nonhazard samples is a key issue in landslide susceptibility analysis. Traditional random sampling and manual labelling methods may involve randomness and...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Bulletin of Geological Science and Technology
2024-03-01
|
Series: | 地质科技通报 |
Subjects: | |
Online Access: | https://dzkjqb.cug.edu.cn/en/article/doi/10.19509/j.cnki.dzkq.tb20220535 |
_version_ | 1797200604472279040 |
---|---|
author | Rilang DENG Qinghua ZHANG Wei LIU Lingwei CHEN Jianhui TAN Zemao GAO Xianchang ZHENG |
author_facet | Rilang DENG Qinghua ZHANG Wei LIU Lingwei CHEN Jianhui TAN Zemao GAO Xianchang ZHENG |
author_sort | Rilang DENG |
collection | DOAJ |
description | Objective Machine learning has been widely applied in the fields of collapse, landslide and debris flow susceptibility analysis. The selection of nonhazard samples is a key issue in landslide susceptibility analysis. Traditional random sampling and manual labelling methods may involve randomness and subjectivity. Methods In view of the potential randomness and representativeness of noncollapse samples, this paper considered soil collapse susceptibility evaluation a positive-unlabelled (PU) learning problem and proposes a two-step convolutional neural network framework (ISpy-CNN) that combines an information value model and the Spy technique. First, 15 collapse-related factors were selected for modelling based on the geomorphological, geological, hydrological, and artificial environmental conditions of the study area. Low-information-value samples that were able to map the distribution structure of noncollapsing samples were screened by the information value model. Then, through the Spy technique and training the CNN model, negative samples with high confidence were identified from low-information-value samples that were classified as noncollapsed samples. Finally, based on the framework and traditional random sampling, we used support vector machine (SVM) and random forest (RF) models to compare and verify the reliability, prediction accuracy and data sensitivity of the proposed learning framework and other models. Results The results illustrate that the proposed ISpy-CNN method can improve the accuracy, F1 value, sensitivity and specificity on the validation set by 6.82%, 6.82%, 6.82%, 8.23%, respectively compared to random sampling and 2.86%, 2.89%, 2.86%, 2.31%, respectively compared to the traditional Spy technique. The prediction accuracy of step 2 in PU learning using the CNN model is higher than that of the RF and SVM models. The sample set screened by the ISpy-CNN framework exhibited greater stability, prediction accuracy and growth rate than those screened by the traditional Spy technique by adding the same number of training samples. Conclusion The ISpy-CNN framework proposed in this paper can better assist in the selection of nonhazard samples and real collapse spatial distribution maps, and the results of the framework are more consistent with the actual collapse distributions. |
first_indexed | 2024-04-24T07:34:17Z |
format | Article |
id | doaj.art-a6e34f6c8a8446e59bda874ab09e4937 |
institution | Directory Open Access Journal |
issn | 2096-8523 |
language | zho |
last_indexed | 2024-04-24T07:34:17Z |
publishDate | 2024-03-01 |
publisher | Editorial Department of Bulletin of Geological Science and Technology |
record_format | Article |
series | 地质科技通报 |
spelling | doaj.art-a6e34f6c8a8446e59bda874ab09e49372024-04-20T10:18:33ZzhoEditorial Department of Bulletin of Geological Science and Technology地质科技通报2096-85232024-03-0143218620010.19509/j.cnki.dzkq.tb20220535dzkjtb-43-2-186Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural networkRilang DENG0Qinghua ZHANG1Wei LIU2Lingwei CHEN3Jianhui TAN4Zemao GAO5Xianchang ZHENG6School of Civil Engineering, Guangzhou University, Guangzhou 510006, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaGuangzhou Urban Planning & Design Survey Research Institute, Guangzhou 510060, ChinaSchool of Civil Engineering, Guangzhou University, Guangzhou 510006, ChinaSchool of Civil Engineering, Guangzhou University, Guangzhou 510006, ChinaObjective Machine learning has been widely applied in the fields of collapse, landslide and debris flow susceptibility analysis. The selection of nonhazard samples is a key issue in landslide susceptibility analysis. Traditional random sampling and manual labelling methods may involve randomness and subjectivity. Methods In view of the potential randomness and representativeness of noncollapse samples, this paper considered soil collapse susceptibility evaluation a positive-unlabelled (PU) learning problem and proposes a two-step convolutional neural network framework (ISpy-CNN) that combines an information value model and the Spy technique. First, 15 collapse-related factors were selected for modelling based on the geomorphological, geological, hydrological, and artificial environmental conditions of the study area. Low-information-value samples that were able to map the distribution structure of noncollapsing samples were screened by the information value model. Then, through the Spy technique and training the CNN model, negative samples with high confidence were identified from low-information-value samples that were classified as noncollapsed samples. Finally, based on the framework and traditional random sampling, we used support vector machine (SVM) and random forest (RF) models to compare and verify the reliability, prediction accuracy and data sensitivity of the proposed learning framework and other models. Results The results illustrate that the proposed ISpy-CNN method can improve the accuracy, F1 value, sensitivity and specificity on the validation set by 6.82%, 6.82%, 6.82%, 8.23%, respectively compared to random sampling and 2.86%, 2.89%, 2.86%, 2.31%, respectively compared to the traditional Spy technique. The prediction accuracy of step 2 in PU learning using the CNN model is higher than that of the RF and SVM models. The sample set screened by the ISpy-CNN framework exhibited greater stability, prediction accuracy and growth rate than those screened by the traditional Spy technique by adding the same number of training samples. Conclusion The ISpy-CNN framework proposed in this paper can better assist in the selection of nonhazard samples and real collapse spatial distribution maps, and the results of the framework are more consistent with the actual collapse distributions.https://dzkjqb.cug.edu.cn/en/article/doi/10.19509/j.cnki.dzkq.tb20220535collapsesusceptibility evaluationpositive and unlabeled (pu) learningspy techniqueinformation valueconvolutional neural networkrandom forestsupport vector machine |
spellingShingle | Rilang DENG Qinghua ZHANG Wei LIU Lingwei CHEN Jianhui TAN Zemao GAO Xianchang ZHENG Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network 地质科技通报 collapse susceptibility evaluation positive and unlabeled (pu) learning spy technique information value convolutional neural network random forest support vector machine |
title | Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network |
title_full | Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network |
title_fullStr | Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network |
title_full_unstemmed | Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network |
title_short | Collapse susceptibility evaluation based on an improved two-step sampling strategy and a convolutional neural network |
title_sort | collapse susceptibility evaluation based on an improved two step sampling strategy and a convolutional neural network |
topic | collapse susceptibility evaluation positive and unlabeled (pu) learning spy technique information value convolutional neural network random forest support vector machine |
url | https://dzkjqb.cug.edu.cn/en/article/doi/10.19509/j.cnki.dzkq.tb20220535 |
work_keys_str_mv | AT rilangdeng collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork AT qinghuazhang collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork AT weiliu collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork AT lingweichen collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork AT jianhuitan collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork AT zemaogao collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork AT xianchangzheng collapsesusceptibilityevaluationbasedonanimprovedtwostepsamplingstrategyandaconvolutionalneuralnetwork |