Integration of multimodal data for large-scale rapid agricultural land evaluation using machine learning and deep learning approaches

Rapid and accurate agricultural land evaluation provides essential guidance for the supervision and allocation of agricultural land resources; it also helps to ensure food security. Previous work has mainly evaluated the land quality at the county level by using field sampling data and based on a fa...

Full description

Bibliographic Details
Main Authors: Liangdan Li, Luo Liu, Yiping Peng, Yingyue Su, Yueming Hu, Runyan Zou
Format: Article
Language:English
Published: Elsevier 2023-11-01
Series:Geoderma
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S0016706123003737
Description
Summary:Rapid and accurate agricultural land evaluation provides essential guidance for the supervision and allocation of agricultural land resources; it also helps to ensure food security. Previous work has mainly evaluated the land quality at the county level by using field sampling data and based on a factor approach. However, it is difficult to achieve uniform, large-scale agricultural land evaluation via conventional approaches because of its spatial heterogeneity, as well as the large temporal and economic costs associated with data acquisition. In this study, we integrated publicly available multimodal data (i.e., satellite remote sensing, environmental, and socioeconomic data) into the Google Earth Engine (GEE) platform, selected the best indicators from each modality using the geodetector, on the basis of which different combinations of input models were designed. And then we developed machine learning (random forest, RF) and deep learning (deep neural network, DNN) models to evaluate the land quality in paddy field and dry land systems in 2013 throughout Guangdong Province, China. The results showed that the performance of our combination of variables decreased in the following order: multimodal > bimodal > unimodal. With the best input combination, the RF model (R2 = 0.91, RMSE = 97.56, and CCC = 0.95) outperformed the DNN model (R2 = 0.89, RMSE = 108.72, and CCC = 0.94) in terms of predicting the quality of paddy field. The RF model (R2 = 0.90, RMSE = 104.27, and CCC = 0.95) also outperformed the DNN model (R2 = 0.86, RMSE = 124.38, and CCC = 0.93) in terms of predicting the quality of dry land. The agricultural land quality estimates obtained using the RF and DNN models were more accurate for paddy field than for dry land systems because of greater land quality homogeneity in paddy fields. This research proposed a simple, low-cost for rapid and accurate agricultural land evaluation at the provincial scale using publicly available multimodal data, which can help to achieve control of the agricultural land grade at multiple spatial and temporal scales.
ISSN:1872-6259