Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas

The performance of four tree-based classification techniques—classification and regression trees (CART), multi-adaptive regression splines (MARS), random forests (RF) and gradient boosting trees (GBT) were compared against the commonly used logistic regression (LR) analysis to assess aquifer vulnera...

Full description

Bibliographic Details
Main Authors: Venkatesh Uddameri, Ana Luiza Bessa Silva, Sreeram Singaraju, Ghazal Mohammadi, E. Annette Hernandez
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Water
Subjects:
Online Access:https://www.mdpi.com/2073-4441/12/4/1023
_version_ 1797571526370787328
author Venkatesh Uddameri
Ana Luiza Bessa Silva
Sreeram Singaraju
Ghazal Mohammadi
E. Annette Hernandez
author_facet Venkatesh Uddameri
Ana Luiza Bessa Silva
Sreeram Singaraju
Ghazal Mohammadi
E. Annette Hernandez
author_sort Venkatesh Uddameri
collection DOAJ
description The performance of four tree-based classification techniques—classification and regression trees (CART), multi-adaptive regression splines (MARS), random forests (RF) and gradient boosting trees (GBT) were compared against the commonly used logistic regression (LR) analysis to assess aquifer vulnerability in the Ogallala Aquifer of Texas. The results indicate that the tree-based models performed better than the logistic regression model, as they were able to locally refine nitrate exceedance probabilities. RF exhibited the best generalizable capabilities. The CART model did better in predicting non-exceedances. Nitrate exceedances were sensitive to well depths—an indicator of aquifer redox conditions, which, in turn, was controlled by alkalinity increases brought forth by the dissolution of calcium carbonate. The clay content of soils and soil organic matter, which serve as indicators of agriculture activities, were also noted to have significant influences on nitrate exceedances. Likely nitrogen releases from confined animal feedlot operations in the northeast portions of the study area also appeared to be locally important. Integrated soil, hydrogeological and geochemical datasets, in conjunction with tree-based methods, help elucidate processes controlling nitrate exceedances. Overall, tree-based models offer flexible, transparent approaches for mapping nitrate exceedances, identifying underlying mechanisms and prioritizing monitoring activities.
first_indexed 2024-03-10T20:42:21Z
format Article
id doaj.art-c543c45f9a2c41fe81003096b5e8e824
institution Directory Open Access Journal
issn 2073-4441
language English
last_indexed 2024-03-10T20:42:21Z
publishDate 2020-04-01
publisher MDPI AG
record_format Article
series Water
spelling doaj.art-c543c45f9a2c41fe81003096b5e8e8242023-11-19T20:37:36ZengMDPI AGWater2073-44412020-04-01124102310.3390/w12041023Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in TexasVenkatesh Uddameri0Ana Luiza Bessa Silva1Sreeram Singaraju2Ghazal Mohammadi3E. Annette Hernandez4Department of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USAThe performance of four tree-based classification techniques—classification and regression trees (CART), multi-adaptive regression splines (MARS), random forests (RF) and gradient boosting trees (GBT) were compared against the commonly used logistic regression (LR) analysis to assess aquifer vulnerability in the Ogallala Aquifer of Texas. The results indicate that the tree-based models performed better than the logistic regression model, as they were able to locally refine nitrate exceedance probabilities. RF exhibited the best generalizable capabilities. The CART model did better in predicting non-exceedances. Nitrate exceedances were sensitive to well depths—an indicator of aquifer redox conditions, which, in turn, was controlled by alkalinity increases brought forth by the dissolution of calcium carbonate. The clay content of soils and soil organic matter, which serve as indicators of agriculture activities, were also noted to have significant influences on nitrate exceedances. Likely nitrogen releases from confined animal feedlot operations in the northeast portions of the study area also appeared to be locally important. Integrated soil, hydrogeological and geochemical datasets, in conjunction with tree-based methods, help elucidate processes controlling nitrate exceedances. Overall, tree-based models offer flexible, transparent approaches for mapping nitrate exceedances, identifying underlying mechanisms and prioritizing monitoring activities.https://www.mdpi.com/2073-4441/12/4/1023aquifer vulnerabilitymachine learningrandom forestsCARTMARSgradient boosting algorithms
spellingShingle Venkatesh Uddameri
Ana Luiza Bessa Silva
Sreeram Singaraju
Ghazal Mohammadi
E. Annette Hernandez
Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
Water
aquifer vulnerability
machine learning
random forests
CART
MARS
gradient boosting algorithms
title Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
title_full Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
title_fullStr Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
title_full_unstemmed Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
title_short Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
title_sort tree based modeling methods to predict nitrate exceedances in the ogallala aquifer in texas
topic aquifer vulnerability
machine learning
random forests
CART
MARS
gradient boosting algorithms
url https://www.mdpi.com/2073-4441/12/4/1023
work_keys_str_mv AT venkateshuddameri treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas
AT analuizabessasilva treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas
AT sreeramsingaraju treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas
AT ghazalmohammadi treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas
AT eannettehernandez treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas