Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data

Soil water content is an important indicator used to maintain the ecological balance of farmland. The efficient spatial prediction of soil water content is crucial for ensuring crop growth and food production. To this end, 104 farmland soil samples were collected in the Yellow River Delta (YRD) in C...

Full description

Bibliographic Details
Main Authors: Dexi Zhan, Yongqi Mu, Wenxu Duan, Mingzhu Ye, Yingqiang Song, Zhenqi Song, Kaizhong Yao, Dengkuo Sun, Ziqi Ding
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Agriculture
Subjects:
Online Access:https://www.mdpi.com/2077-0472/13/5/1088
_version_ 1827742430962647040
author Dexi Zhan
Yongqi Mu
Wenxu Duan
Mingzhu Ye
Yingqiang Song
Zhenqi Song
Kaizhong Yao
Dengkuo Sun
Ziqi Ding
author_facet Dexi Zhan
Yongqi Mu
Wenxu Duan
Mingzhu Ye
Yingqiang Song
Zhenqi Song
Kaizhong Yao
Dengkuo Sun
Ziqi Ding
author_sort Dexi Zhan
collection DOAJ
description Soil water content is an important indicator used to maintain the ecological balance of farmland. The efficient spatial prediction of soil water content is crucial for ensuring crop growth and food production. To this end, 104 farmland soil samples were collected in the Yellow River Delta (YRD) in China, and the soil water content was determined using the drying method. A gradient boosting decision tree (GBDT) model based on a tree-structured Parzen estimator (TPE) hyperparametric optimization was developed, and then the soil water content was predicted and mapped based on the soil texture and vegetation index from Sentinel-2 remote sensing images. The results of statistical analysis showed that the soil water content had a high coefficient of variation (55.30%), a non-normal distribution, and complex spatial variability. Compared with other models, the TPE-GBDT model had the highest prediction accuracy (RMSE = 6.02% and R<sup>2</sup> = 0.71), and its mapping results showed that the areas with high soil water content were distributed on both sides of the river and near the estuary. Furthermore, the results of Shapley additive explanation (SHAP) analysis showed that the soil texture (PC2 and PC5), modified normalized difference vegetation index (MNDVI), and Sentinel-2 red edge position (S2REP) index provided important contributions to the spatial prediction of soil water content. We found that the hydraulic physical properties of soil texture and the vegetation characteristics (such as vegetation coverage, root action, and transpiration) are the key factors affecting the spatial migration and heterogeneity of the soil water content in the study area. The above results show that the TPE algorithm can quickly capture the hyperparameters that are most suitable for the GBDT model, so that the GBDT model can ensure prediction accuracy, reduce the loss function with less training data, and accurately learn of the nonlinear relationship between soil water content and environmental factors. This paper proposes a machine learning method for hyperparameter optimization that shows considerable potential to predict the spatial heterogeneity of soil water content, which can effectively support regional farmland soil and water conservation and high-quality agricultural development.
first_indexed 2024-03-11T04:01:48Z
format Article
id doaj.art-a28ee71db9ce4447ae29e2abd67df657
institution Directory Open Access Journal
issn 2077-0472
language English
last_indexed 2024-03-11T04:01:48Z
publishDate 2023-05-01
publisher MDPI AG
record_format Article
series Agriculture
spelling doaj.art-a28ee71db9ce4447ae29e2abd67df6572023-11-18T00:03:57ZengMDPI AGAgriculture2077-04722023-05-01135108810.3390/agriculture13051088Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing DataDexi Zhan0Yongqi Mu1Wenxu Duan2Mingzhu Ye3Yingqiang Song4Zhenqi Song5Kaizhong Yao6Dengkuo Sun7Ziqi Ding8School of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSchool of Civil Engineering and Geomatics, Shandong University of Technology, Zibo 255000, ChinaSoil water content is an important indicator used to maintain the ecological balance of farmland. The efficient spatial prediction of soil water content is crucial for ensuring crop growth and food production. To this end, 104 farmland soil samples were collected in the Yellow River Delta (YRD) in China, and the soil water content was determined using the drying method. A gradient boosting decision tree (GBDT) model based on a tree-structured Parzen estimator (TPE) hyperparametric optimization was developed, and then the soil water content was predicted and mapped based on the soil texture and vegetation index from Sentinel-2 remote sensing images. The results of statistical analysis showed that the soil water content had a high coefficient of variation (55.30%), a non-normal distribution, and complex spatial variability. Compared with other models, the TPE-GBDT model had the highest prediction accuracy (RMSE = 6.02% and R<sup>2</sup> = 0.71), and its mapping results showed that the areas with high soil water content were distributed on both sides of the river and near the estuary. Furthermore, the results of Shapley additive explanation (SHAP) analysis showed that the soil texture (PC2 and PC5), modified normalized difference vegetation index (MNDVI), and Sentinel-2 red edge position (S2REP) index provided important contributions to the spatial prediction of soil water content. We found that the hydraulic physical properties of soil texture and the vegetation characteristics (such as vegetation coverage, root action, and transpiration) are the key factors affecting the spatial migration and heterogeneity of the soil water content in the study area. The above results show that the TPE algorithm can quickly capture the hyperparameters that are most suitable for the GBDT model, so that the GBDT model can ensure prediction accuracy, reduce the loss function with less training data, and accurately learn of the nonlinear relationship between soil water content and environmental factors. This paper proposes a machine learning method for hyperparameter optimization that shows considerable potential to predict the spatial heterogeneity of soil water content, which can effectively support regional farmland soil and water conservation and high-quality agricultural development.https://www.mdpi.com/2077-0472/13/5/1088machine learningGBDThyperparametersoil water contentSentinel-2farmland
spellingShingle Dexi Zhan
Yongqi Mu
Wenxu Duan
Mingzhu Ye
Yingqiang Song
Zhenqi Song
Kaizhong Yao
Dengkuo Sun
Ziqi Ding
Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
Agriculture
machine learning
GBDT
hyperparameter
soil water content
Sentinel-2
farmland
title Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
title_full Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
title_fullStr Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
title_full_unstemmed Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
title_short Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data
title_sort spatial prediction and mapping of soil water content by tpe gbdt model in chinese coastal delta farmland with sentinel 2 remote sensing data
topic machine learning
GBDT
hyperparameter
soil water content
Sentinel-2
farmland
url https://www.mdpi.com/2077-0472/13/5/1088
work_keys_str_mv AT dexizhan spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT yongqimu spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT wenxuduan spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT mingzhuye spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT yingqiangsong spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT zhenqisong spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT kaizhongyao spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT dengkuosun spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata
AT ziqiding spatialpredictionandmappingofsoilwatercontentbytpegbdtmodelinchinesecoastaldeltafarmlandwithsentinel2remotesensingdata