Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas
The performance of four tree-based classification techniques—classification and regression trees (CART), multi-adaptive regression splines (MARS), random forests (RF) and gradient boosting trees (GBT) were compared against the commonly used logistic regression (LR) analysis to assess aquifer vulnera...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-04-01
|
Series: | Water |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4441/12/4/1023 |
_version_ | 1797571526370787328 |
---|---|
author | Venkatesh Uddameri Ana Luiza Bessa Silva Sreeram Singaraju Ghazal Mohammadi E. Annette Hernandez |
author_facet | Venkatesh Uddameri Ana Luiza Bessa Silva Sreeram Singaraju Ghazal Mohammadi E. Annette Hernandez |
author_sort | Venkatesh Uddameri |
collection | DOAJ |
description | The performance of four tree-based classification techniques—classification and regression trees (CART), multi-adaptive regression splines (MARS), random forests (RF) and gradient boosting trees (GBT) were compared against the commonly used logistic regression (LR) analysis to assess aquifer vulnerability in the Ogallala Aquifer of Texas. The results indicate that the tree-based models performed better than the logistic regression model, as they were able to locally refine nitrate exceedance probabilities. RF exhibited the best generalizable capabilities. The CART model did better in predicting non-exceedances. Nitrate exceedances were sensitive to well depths—an indicator of aquifer redox conditions, which, in turn, was controlled by alkalinity increases brought forth by the dissolution of calcium carbonate. The clay content of soils and soil organic matter, which serve as indicators of agriculture activities, were also noted to have significant influences on nitrate exceedances. Likely nitrogen releases from confined animal feedlot operations in the northeast portions of the study area also appeared to be locally important. Integrated soil, hydrogeological and geochemical datasets, in conjunction with tree-based methods, help elucidate processes controlling nitrate exceedances. Overall, tree-based models offer flexible, transparent approaches for mapping nitrate exceedances, identifying underlying mechanisms and prioritizing monitoring activities. |
first_indexed | 2024-03-10T20:42:21Z |
format | Article |
id | doaj.art-c543c45f9a2c41fe81003096b5e8e824 |
institution | Directory Open Access Journal |
issn | 2073-4441 |
language | English |
last_indexed | 2024-03-10T20:42:21Z |
publishDate | 2020-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Water |
spelling | doaj.art-c543c45f9a2c41fe81003096b5e8e8242023-11-19T20:37:36ZengMDPI AGWater2073-44412020-04-01124102310.3390/w12041023Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in TexasVenkatesh Uddameri0Ana Luiza Bessa Silva1Sreeram Singaraju2Ghazal Mohammadi3E. Annette Hernandez4Department of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USADepartment of Civil, Environmental and Construction Engineering, Tech University, Lubbock, TX 79409-1023, USAThe performance of four tree-based classification techniques—classification and regression trees (CART), multi-adaptive regression splines (MARS), random forests (RF) and gradient boosting trees (GBT) were compared against the commonly used logistic regression (LR) analysis to assess aquifer vulnerability in the Ogallala Aquifer of Texas. The results indicate that the tree-based models performed better than the logistic regression model, as they were able to locally refine nitrate exceedance probabilities. RF exhibited the best generalizable capabilities. The CART model did better in predicting non-exceedances. Nitrate exceedances were sensitive to well depths—an indicator of aquifer redox conditions, which, in turn, was controlled by alkalinity increases brought forth by the dissolution of calcium carbonate. The clay content of soils and soil organic matter, which serve as indicators of agriculture activities, were also noted to have significant influences on nitrate exceedances. Likely nitrogen releases from confined animal feedlot operations in the northeast portions of the study area also appeared to be locally important. Integrated soil, hydrogeological and geochemical datasets, in conjunction with tree-based methods, help elucidate processes controlling nitrate exceedances. Overall, tree-based models offer flexible, transparent approaches for mapping nitrate exceedances, identifying underlying mechanisms and prioritizing monitoring activities.https://www.mdpi.com/2073-4441/12/4/1023aquifer vulnerabilitymachine learningrandom forestsCARTMARSgradient boosting algorithms |
spellingShingle | Venkatesh Uddameri Ana Luiza Bessa Silva Sreeram Singaraju Ghazal Mohammadi E. Annette Hernandez Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas Water aquifer vulnerability machine learning random forests CART MARS gradient boosting algorithms |
title | Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas |
title_full | Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas |
title_fullStr | Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas |
title_full_unstemmed | Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas |
title_short | Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas |
title_sort | tree based modeling methods to predict nitrate exceedances in the ogallala aquifer in texas |
topic | aquifer vulnerability machine learning random forests CART MARS gradient boosting algorithms |
url | https://www.mdpi.com/2073-4441/12/4/1023 |
work_keys_str_mv | AT venkateshuddameri treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas AT analuizabessasilva treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas AT sreeramsingaraju treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas AT ghazalmohammadi treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas AT eannettehernandez treebasedmodelingmethodstopredictnitrateexceedancesintheogallalaaquiferintexas |