Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators

The US Drought Monitor (USDM) is a hallmark in real time drought monitoring and assessment as it was developed by multiple agencies to provide an accurate and timely assessment of drought conditions in the US on a weekly basis. The map is built based on multiple physical indicators as well as report...

Full description

Bibliographic Details
Main Authors: Pouyan Hatami Bahman Beiglou, Lifeng Luo, Pang-Ning Tan, Lisi Pei
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-10-01
Series:Frontiers in Big Data
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fdata.2021.750536/full
_version_ 1819259935765037056
author Pouyan Hatami Bahman Beiglou
Lifeng Luo
Pang-Ning Tan
Lisi Pei
author_facet Pouyan Hatami Bahman Beiglou
Lifeng Luo
Pang-Ning Tan
Lisi Pei
author_sort Pouyan Hatami Bahman Beiglou
collection DOAJ
description The US Drought Monitor (USDM) is a hallmark in real time drought monitoring and assessment as it was developed by multiple agencies to provide an accurate and timely assessment of drought conditions in the US on a weekly basis. The map is built based on multiple physical indicators as well as reported observations from local contributors before human analysts combine the information and produce the drought map using their best judgement. Since human subjectivity is included in the production of the USDM maps, it is not an entirely clear quantitative procedure for other entities to reproduce the maps. In this study, we developed a framework to automatically generate the maps through a machine learning approach by predicting the drought categories across the domain of study. A persistence model served as the baseline model for comparison in the framework. Three machine learning algorithms, logistic regression, random forests, and support vector machines, with four different groups of input data, which formed an overall of 12 different configurations, were used for the prediction of drought categories. Finally, all the configurations were evaluated against the baseline model to select the best performing option. The results showed that our proposed framework could reproduce the drought maps to a near-perfect level with the support vector machines algorithm and the group 4 data. The rest of the findings of this study can be highlighted as: 1) employing the past week drought data as a predictor in the models played an important role in achieving high prediction scores, 2) the nonlinear models, random forest, and support vector machines had a better overall performance compared to the logistic regression models, and 3) with borrowing the neighboring grid cells information, we could compensate the lack of training data in the grid cells with insufficient historical USDM data particularly for extreme and exceptional drought conditions.
first_indexed 2024-12-23T19:17:55Z
format Article
id doaj.art-b82df35832a94046887f98753b3552d3
institution Directory Open Access Journal
issn 2624-909X
language English
last_indexed 2024-12-23T19:17:55Z
publishDate 2021-10-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Big Data
spelling doaj.art-b82df35832a94046887f98753b3552d32022-12-21T17:34:15ZengFrontiers Media S.A.Frontiers in Big Data2624-909X2021-10-01410.3389/fdata.2021.750536750536Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought IndicatorsPouyan Hatami Bahman Beiglou0Lifeng Luo1Pang-Ning Tan2Lisi Pei3Department of Geography, Environment, and Spatial Sciences, College of Social Science, Michigan State University, East Lansing, MI, United StatesDepartment of Geography, Environment, and Spatial Sciences, College of Social Science, Michigan State University, East Lansing, MI, United StatesDepartment of Computer Science and Engineering, College of Engineering, Michigan State University, East Lansing, MI, United StatesDepartment of Geography, Environment, and Spatial Sciences, College of Social Science, Michigan State University, East Lansing, MI, United StatesThe US Drought Monitor (USDM) is a hallmark in real time drought monitoring and assessment as it was developed by multiple agencies to provide an accurate and timely assessment of drought conditions in the US on a weekly basis. The map is built based on multiple physical indicators as well as reported observations from local contributors before human analysts combine the information and produce the drought map using their best judgement. Since human subjectivity is included in the production of the USDM maps, it is not an entirely clear quantitative procedure for other entities to reproduce the maps. In this study, we developed a framework to automatically generate the maps through a machine learning approach by predicting the drought categories across the domain of study. A persistence model served as the baseline model for comparison in the framework. Three machine learning algorithms, logistic regression, random forests, and support vector machines, with four different groups of input data, which formed an overall of 12 different configurations, were used for the prediction of drought categories. Finally, all the configurations were evaluated against the baseline model to select the best performing option. The results showed that our proposed framework could reproduce the drought maps to a near-perfect level with the support vector machines algorithm and the group 4 data. The rest of the findings of this study can be highlighted as: 1) employing the past week drought data as a predictor in the models played an important role in achieving high prediction scores, 2) the nonlinear models, random forest, and support vector machines had a better overall performance compared to the logistic regression models, and 3) with borrowing the neighboring grid cells information, we could compensate the lack of training data in the grid cells with insufficient historical USDM data particularly for extreme and exceptional drought conditions.https://www.frontiersin.org/articles/10.3389/fdata.2021.750536/fullUSDMmachine learningdrought monitoringlogistic regressionrandom forestSVM–support vector machines
spellingShingle Pouyan Hatami Bahman Beiglou
Lifeng Luo
Pang-Ning Tan
Lisi Pei
Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators
Frontiers in Big Data
USDM
machine learning
drought monitoring
logistic regression
random forest
SVM–support vector machines
title Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators
title_full Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators
title_fullStr Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators
title_full_unstemmed Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators
title_short Automated Analysis of the US Drought Monitor Maps With Machine Learning and Multiple Drought Indicators
title_sort automated analysis of the us drought monitor maps with machine learning and multiple drought indicators
topic USDM
machine learning
drought monitoring
logistic regression
random forest
SVM–support vector machines
url https://www.frontiersin.org/articles/10.3389/fdata.2021.750536/full
work_keys_str_mv AT pouyanhatamibahmanbeiglou automatedanalysisoftheusdroughtmonitormapswithmachinelearningandmultipledroughtindicators
AT lifengluo automatedanalysisoftheusdroughtmonitormapswithmachinelearningandmultipledroughtindicators
AT pangningtan automatedanalysisoftheusdroughtmonitormapswithmachinelearningandmultipledroughtindicators
AT lisipei automatedanalysisoftheusdroughtmonitormapswithmachinelearningandmultipledroughtindicators