Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models

Soil moisture, soil organic carbon, and nitrogen content prediction are considered significant fields of study as they are directly related to plant health and food production. Direct estimation of these soil properties with traditional methods, for example, the oven-drying technique and chemical an...

Full description

Bibliographic Details
Main Authors: Dristi Datta, Manoranjan Paul, Manzur Murshed, Shyh Wei Teng, Leigh Schmidtke
Format: Article
Language:English
Published: MDPI AG 2022-10-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/20/7998
_version_ 1797469907084902400
author Dristi Datta
Manoranjan Paul
Manzur Murshed
Shyh Wei Teng
Leigh Schmidtke
author_facet Dristi Datta
Manoranjan Paul
Manzur Murshed
Shyh Wei Teng
Leigh Schmidtke
author_sort Dristi Datta
collection DOAJ
description Soil moisture, soil organic carbon, and nitrogen content prediction are considered significant fields of study as they are directly related to plant health and food production. Direct estimation of these soil properties with traditional methods, for example, the oven-drying technique and chemical analysis, is a time and resource-consuming approach and can predict only smaller areas. With the significant development of remote sensing and hyperspectral (HS) imaging technologies, soil moisture, carbon, and nitrogen can be estimated over vast areas. This paper presents a generalized approach to predicting three different essential soil contents using a comprehensive study of various machine learning (ML) models by considering the dimensional reduction in feature spaces. In this study, we have used three popular benchmark HS datasets captured in Germany and Sweden. The efficacy of different ML algorithms is evaluated to predict soil content, and significant improvement is obtained when a specific range of bands is selected. The performance of ML models is further improved by applying principal component analysis (PCA), a dimensional reduction method that works with an unsupervised learning method. The effect of soil temperature on soil moisture prediction is evaluated in this study, and the results show that when the soil temperature is considered with the HS band, the soil moisture prediction accuracy does not improve. However, the combined effect of band selection and feature transformation using PCA significantly enhances the prediction accuracy for soil moisture, carbon, and nitrogen content. This study represents a comprehensive analysis of a wide range of established ML regression models using data preprocessing, effective band selection, and data dimension reduction and attempt to understand which feature combinations provide the best accuracy. The outcomes of several ML models are verified with validation techniques and the best- and worst-case scenarios in terms of soil content are noted. The proposed approach outperforms existing estimation techniques.
first_indexed 2024-03-09T19:29:35Z
format Article
id doaj.art-403459cc268149a3bb6f66a0a9c090d7
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T19:29:35Z
publishDate 2022-10-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-403459cc268149a3bb6f66a0a9c090d72023-11-24T02:30:04ZengMDPI AGSensors1424-82202022-10-012220799810.3390/s22207998Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression ModelsDristi Datta0Manoranjan Paul1Manzur Murshed2Shyh Wei Teng3Leigh Schmidtke4School of Computing, Mathematics, and Engineering, Charles Sturt University, Bathurst, NSW 2795, AustraliaSchool of Computing, Mathematics, and Engineering, Charles Sturt University, Bathurst, NSW 2795, AustraliaCentre for Smart Analytics, Federation University Australia, Berwick, VIC 3806, AustraliaInstitute of Innovation, Science and Sustainability, Federation University Australia, Berwick, VIC 3806, AustraliaGulbali Institue, Charles Sturt University, Wagga Wagga, NSW 2650, AustraliaSoil moisture, soil organic carbon, and nitrogen content prediction are considered significant fields of study as they are directly related to plant health and food production. Direct estimation of these soil properties with traditional methods, for example, the oven-drying technique and chemical analysis, is a time and resource-consuming approach and can predict only smaller areas. With the significant development of remote sensing and hyperspectral (HS) imaging technologies, soil moisture, carbon, and nitrogen can be estimated over vast areas. This paper presents a generalized approach to predicting three different essential soil contents using a comprehensive study of various machine learning (ML) models by considering the dimensional reduction in feature spaces. In this study, we have used three popular benchmark HS datasets captured in Germany and Sweden. The efficacy of different ML algorithms is evaluated to predict soil content, and significant improvement is obtained when a specific range of bands is selected. The performance of ML models is further improved by applying principal component analysis (PCA), a dimensional reduction method that works with an unsupervised learning method. The effect of soil temperature on soil moisture prediction is evaluated in this study, and the results show that when the soil temperature is considered with the HS band, the soil moisture prediction accuracy does not improve. However, the combined effect of band selection and feature transformation using PCA significantly enhances the prediction accuracy for soil moisture, carbon, and nitrogen content. This study represents a comprehensive analysis of a wide range of established ML regression models using data preprocessing, effective band selection, and data dimension reduction and attempt to understand which feature combinations provide the best accuracy. The outcomes of several ML models are verified with validation techniques and the best- and worst-case scenarios in terms of soil content are noted. The proposed approach outperforms existing estimation techniques.https://www.mdpi.com/1424-8220/22/20/7998LUCAS databand selectionmachine learningprincipal component analysisk-fold cross validation
spellingShingle Dristi Datta
Manoranjan Paul
Manzur Murshed
Shyh Wei Teng
Leigh Schmidtke
Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models
Sensors
LUCAS data
band selection
machine learning
principal component analysis
k-fold cross validation
title Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models
title_full Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models
title_fullStr Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models
title_full_unstemmed Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models
title_short Soil Moisture, Organic Carbon, and Nitrogen Content Prediction with Hyperspectral Data Using Regression Models
title_sort soil moisture organic carbon and nitrogen content prediction with hyperspectral data using regression models
topic LUCAS data
band selection
machine learning
principal component analysis
k-fold cross validation
url https://www.mdpi.com/1424-8220/22/20/7998
work_keys_str_mv AT dristidatta soilmoistureorganiccarbonandnitrogencontentpredictionwithhyperspectraldatausingregressionmodels
AT manoranjanpaul soilmoistureorganiccarbonandnitrogencontentpredictionwithhyperspectraldatausingregressionmodels
AT manzurmurshed soilmoistureorganiccarbonandnitrogencontentpredictionwithhyperspectraldatausingregressionmodels
AT shyhweiteng soilmoistureorganiccarbonandnitrogencontentpredictionwithhyperspectraldatausingregressionmodels
AT leighschmidtke soilmoistureorganiccarbonandnitrogencontentpredictionwithhyperspectraldatausingregressionmodels