Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds

The universal soil loss equation (USLE) is a widely used empirical model for estimating soil loss. Among the USLE model factors, the cover management factor (C-factor) is a critical factor that substantially impacts the estimation result. Assigning C-factor values according to a land-use/land-cover...

Full description

Bibliographic Details
Main Authors: Fuan Tsai, Jhe-Syuan Lai, Kieu Anh Nguyen, Walter Chen
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/10/1/19
_version_ 1827604097402929152
author Fuan Tsai
Jhe-Syuan Lai
Kieu Anh Nguyen
Walter Chen
author_facet Fuan Tsai
Jhe-Syuan Lai
Kieu Anh Nguyen
Walter Chen
author_sort Fuan Tsai
collection DOAJ
description The universal soil loss equation (USLE) is a widely used empirical model for estimating soil loss. Among the USLE model factors, the cover management factor (C-factor) is a critical factor that substantially impacts the estimation result. Assigning C-factor values according to a land-use/land-cover (LULC) map from field surveys is a typical traditional approach. However, this approach may have limitations caused by the difficulty and cost in conducting field surveys and updating the LULC map regularly, thus significantly affecting the feasibility of multi-temporal analysis of soil erosion. To address this issue, this study uses data mining to build a random forest (RF) model between eight geospatial factors and the C-factor for the Shihmen Reservoir watershed in northern Taiwan for multi-temporal estimation of soil loss. The eight geospatial factors were collected or derived from remotely sensed images taken in 2004, a digital elevation model, and related digital maps. Due to the memory size limitation of the R software, only 4% of the total data points (population dataset) in each C-factor class were selected as the sample dataset (input dataset) for analysis using the stratified random sampling method. Seventy percent of the input dataset was used to train the RF model, and the other 30% was used to test the model. The results show that the RF model could capture the trend of vegetation recovery and soil loss reduction after the destructive event of Typhoon Aere in 2004 for multi-temporal analysis. Although the RF model was biased by the majority class’s large sample size (C = 0.01 class), the estimated soil erosion rate was close to the measurement obtained by the erosion pins installed in the watershed (90.6 t/ha-year). After the model’s completion, we furthered our aim to address the input dataset’s imbalanced data problem to improve the model’s classification performance. An ad-hoc down-sampling of the majority class technique was used to reduce the majority class’s sampling rate to 2%, 1%, and 0.5% while keeping the other minority classes at a 4% sample rate. The results show an improvement of the Kappa coefficient from 0.574 to 0.732, the AUC from 0.780 to 0.891, and the true positive rate of all minority classes combined from 0.43 to 0.70. However, the overall accuracy decreases from 0.952 to 0.846, and the true positive rate of the majority class declines from 0.99 to 0.94. The best average C-factor was achieved when the sampling rate of the majority class was 1%. On the other hand, the best soil erosion estimate was obtained when the sampling rate was 2%.
first_indexed 2024-03-09T05:53:10Z
format Article
id doaj.art-57a1a95193674d29948477b9d0b69ea9
institution Directory Open Access Journal
issn 2220-9964
language English
last_indexed 2024-03-09T05:53:10Z
publishDate 2021-01-01
publisher MDPI AG
record_format Article
series ISPRS International Journal of Geo-Information
spelling doaj.art-57a1a95193674d29948477b9d0b69ea92023-12-03T12:15:35ZengMDPI AGISPRS International Journal of Geo-Information2220-99642021-01-011011910.3390/ijgi10010019Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in WatershedsFuan Tsai0Jhe-Syuan Lai1Kieu Anh Nguyen2Walter Chen3Center for Space and Remote Sensing Research, National Central University, Zhongli City, Taoyuan 320, TaiwanDepartment of Civil Engineering, Feng Chia University, Taichung 40724, TaiwanDepartment of Civil Engineering, National Taipei University of Technology, Taipei 10608, TaiwanDepartment of Civil Engineering, National Taipei University of Technology, Taipei 10608, TaiwanThe universal soil loss equation (USLE) is a widely used empirical model for estimating soil loss. Among the USLE model factors, the cover management factor (C-factor) is a critical factor that substantially impacts the estimation result. Assigning C-factor values according to a land-use/land-cover (LULC) map from field surveys is a typical traditional approach. However, this approach may have limitations caused by the difficulty and cost in conducting field surveys and updating the LULC map regularly, thus significantly affecting the feasibility of multi-temporal analysis of soil erosion. To address this issue, this study uses data mining to build a random forest (RF) model between eight geospatial factors and the C-factor for the Shihmen Reservoir watershed in northern Taiwan for multi-temporal estimation of soil loss. The eight geospatial factors were collected or derived from remotely sensed images taken in 2004, a digital elevation model, and related digital maps. Due to the memory size limitation of the R software, only 4% of the total data points (population dataset) in each C-factor class were selected as the sample dataset (input dataset) for analysis using the stratified random sampling method. Seventy percent of the input dataset was used to train the RF model, and the other 30% was used to test the model. The results show that the RF model could capture the trend of vegetation recovery and soil loss reduction after the destructive event of Typhoon Aere in 2004 for multi-temporal analysis. Although the RF model was biased by the majority class’s large sample size (C = 0.01 class), the estimated soil erosion rate was close to the measurement obtained by the erosion pins installed in the watershed (90.6 t/ha-year). After the model’s completion, we furthered our aim to address the input dataset’s imbalanced data problem to improve the model’s classification performance. An ad-hoc down-sampling of the majority class technique was used to reduce the majority class’s sampling rate to 2%, 1%, and 0.5% while keeping the other minority classes at a 4% sample rate. The results show an improvement of the Kappa coefficient from 0.574 to 0.732, the AUC from 0.780 to 0.891, and the true positive rate of all minority classes combined from 0.43 to 0.70. However, the overall accuracy decreases from 0.952 to 0.846, and the true positive rate of the majority class declines from 0.99 to 0.94. The best average C-factor was achieved when the sampling rate of the majority class was 1%. On the other hand, the best soil erosion estimate was obtained when the sampling rate was 2%.https://www.mdpi.com/2220-9964/10/1/19universal soil loss equationUSLENDVISAVImulti-temporal remote sensingspatial data mining
spellingShingle Fuan Tsai
Jhe-Syuan Lai
Kieu Anh Nguyen
Walter Chen
Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds
ISPRS International Journal of Geo-Information
universal soil loss equation
USLE
NDVI
SAVI
multi-temporal remote sensing
spatial data mining
title Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds
title_full Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds
title_fullStr Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds
title_full_unstemmed Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds
title_short Determining Cover Management Factor with Remote Sensing and Spatial Analysis for Improving Long-Term Soil Loss Estimation in Watersheds
title_sort determining cover management factor with remote sensing and spatial analysis for improving long term soil loss estimation in watersheds
topic universal soil loss equation
USLE
NDVI
SAVI
multi-temporal remote sensing
spatial data mining
url https://www.mdpi.com/2220-9964/10/1/19
work_keys_str_mv AT fuantsai determiningcovermanagementfactorwithremotesensingandspatialanalysisforimprovinglongtermsoillossestimationinwatersheds
AT jhesyuanlai determiningcovermanagementfactorwithremotesensingandspatialanalysisforimprovinglongtermsoillossestimationinwatersheds
AT kieuanhnguyen determiningcovermanagementfactorwithremotesensingandspatialanalysisforimprovinglongtermsoillossestimationinwatersheds
AT walterchen determiningcovermanagementfactorwithremotesensingandspatialanalysisforimprovinglongtermsoillossestimationinwatersheds