Interpretable Machine Learning Methods for Landslide Analysis

Landslides have the power to alter terrain, reshape ecosystems, damage anthropogenic structures, and destroy entire communities. On March 31, 2017, Mocoa fell victim to a devastating landslide that claimed 254 lives and left hundreds more missing. Given the region’s high frequency of landslides, con...

Full description

Bibliographic Details
Main Author:	Gupta, Deepankar
Other Authors:	Fernandez, John E.
Format:	Thesis
Published:	Massachusetts Institute of Technology 2022
Online Access:	https://hdl.handle.net/1721.1/143311

_version_	1826211763070173184
author	Gupta, Deepankar
author2	Fernandez, John E.
author_facet	Fernandez, John E. Gupta, Deepankar
author_sort	Gupta, Deepankar
collection	MIT
description	Landslides have the power to alter terrain, reshape ecosystems, damage anthropogenic structures, and destroy entire communities. On March 31, 2017, Mocoa fell victim to a devastating landslide that claimed 254 lives and left hundreds more missing. Given the region’s high frequency of landslides, conducting a formal investigation of its landslides and the factors that make the region susceptible to them is urgent. In this project, we use machine learning to computationally study landslide detection, the likelihood of past landslides occurrence, and landslide susceptibility, or risk, in the Mocoa region. The region’s geographical and climate features make it elusive to remote-sensing technologies and difficult to survey. We combat the resulting data sparsity by carefully designing learning tasks and data pipelines for detection and susceptibility. We meaningfully extract 20 features with scientific or computational basis. We then provide a comprehensive evaluation of four different machine learning models: logistic regression (LR), decision tree (DT), random forest (RF), and convolutional neural network (CNN) on both the landslide detection and the landslide susceptibility tasks. All four models displayed performance that was significantly better than random on both tasks. CNN models achieved the highest classification accuracy in the area of interest (AOI) for both tasks, earning 87.3% for detection and 92.5% for susceptibility.We also probed all four types of models using multiple techniques to determine features important to their decision making. Across the landslide detection models, slope, aspect, and the presence of claystones were the most consistent important features for inferring past landslide likelihood. For landslide susceptibility, we found slope, aspect, distance from fault lines, and the presence of claystones to be the most consistent important features for inferring landslide risk. The reliance of the models on slope and aspect does not surprise due to landslides involving mass movement from high to low points of elevation. Less surprising, the presence of claystones also appears to be important to both landslide tasks. The connection between claystones and Mocoan landslides merits further investigation. The importance of distance from fault lines to susceptibility models suggests that seismic activity at fault lines is a key trigger for Mocoan landslides.
first_indexed	2024-09-23T15:11:03Z
format	Thesis
id	mit-1721.1/143311
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T15:11:03Z
publishDate	2022
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1433112022-06-16T03:10:48Z Interpretable Machine Learning Methods for Landslide Analysis Gupta, Deepankar Fernandez, John E. Freeman, William Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Landslides have the power to alter terrain, reshape ecosystems, damage anthropogenic structures, and destroy entire communities. On March 31, 2017, Mocoa fell victim to a devastating landslide that claimed 254 lives and left hundreds more missing. Given the region’s high frequency of landslides, conducting a formal investigation of its landslides and the factors that make the region susceptible to them is urgent. In this project, we use machine learning to computationally study landslide detection, the likelihood of past landslides occurrence, and landslide susceptibility, or risk, in the Mocoa region. The region’s geographical and climate features make it elusive to remote-sensing technologies and difficult to survey. We combat the resulting data sparsity by carefully designing learning tasks and data pipelines for detection and susceptibility. We meaningfully extract 20 features with scientific or computational basis. We then provide a comprehensive evaluation of four different machine learning models: logistic regression (LR), decision tree (DT), random forest (RF), and convolutional neural network (CNN) on both the landslide detection and the landslide susceptibility tasks. All four models displayed performance that was significantly better than random on both tasks. CNN models achieved the highest classification accuracy in the area of interest (AOI) for both tasks, earning 87.3% for detection and 92.5% for susceptibility.We also probed all four types of models using multiple techniques to determine features important to their decision making. Across the landslide detection models, slope, aspect, and the presence of claystones were the most consistent important features for inferring past landslide likelihood. For landslide susceptibility, we found slope, aspect, distance from fault lines, and the presence of claystones to be the most consistent important features for inferring landslide risk. The reliance of the models on slope and aspect does not surprise due to landslides involving mass movement from high to low points of elevation. Less surprising, the presence of claystones also appears to be important to both landslide tasks. The connection between claystones and Mocoan landslides merits further investigation. The importance of distance from fault lines to susceptibility models suggests that seismic activity at fault lines is a key trigger for Mocoan landslides. M.Eng. 2022-06-15T13:11:37Z 2022-06-15T13:11:37Z 2022-02 2022-02-22T18:32:05.467Z Thesis https://hdl.handle.net/1721.1/143311 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Gupta, Deepankar Interpretable Machine Learning Methods for Landslide Analysis
title	Interpretable Machine Learning Methods for Landslide Analysis
title_full	Interpretable Machine Learning Methods for Landslide Analysis
title_fullStr	Interpretable Machine Learning Methods for Landslide Analysis
title_full_unstemmed	Interpretable Machine Learning Methods for Landslide Analysis
title_short	Interpretable Machine Learning Methods for Landslide Analysis
title_sort	interpretable machine learning methods for landslide analysis
url	https://hdl.handle.net/1721.1/143311
work_keys_str_mv	AT guptadeepankar interpretablemachinelearningmethodsforlandslideanalysis

Interpretable Machine Learning Methods for Landslide Analysis

Similar Items