Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China

Abstract The reconstruction models of hourly 10‐m wind speeds were developed for each of 2384 stations over China using stepwise regression, random forest and XGBoost machine learning approaches based on hourly observed and ERA5 reanalysis data from 2005 to 2021. The reconstruction procedures applie...

Full description

Bibliographic Details
Main Authors: Chad Shouquan Cheng, Shufeng Yang, Jian Zhu, Tao Yang, Gaozhen Nie, Jian Tang, Yanyan Lu
Format: Article
Language:English
Published: American Geophysical Union (AGU) 2023-11-01
Series:Earth and Space Science
Subjects:
Online Access:https://doi.org/10.1029/2023EA003295
_version_ 1797448959546884096
author Chad Shouquan Cheng
Shufeng Yang
Jian Zhu
Tao Yang
Gaozhen Nie
Jian Tang
Yanyan Lu
author_facet Chad Shouquan Cheng
Shufeng Yang
Jian Zhu
Tao Yang
Gaozhen Nie
Jian Tang
Yanyan Lu
author_sort Chad Shouquan Cheng
collection DOAJ
description Abstract The reconstruction models of hourly 10‐m wind speeds were developed for each of 2384 stations over China using stepwise regression, random forest and XGBoost machine learning approaches based on hourly observed and ERA5 reanalysis data from 2005 to 2021. The reconstruction procedures applied observed hourly data to reduce the systematic biases of reanalysis data sets. Furthermore, the procedures employed the past dynamically consistent states of the atmosphere simulated by ERA5 reanalysis techniques to reduce/remove wind‐observed data impacts of long‐term non‐meteorological condition changes over time (e.g., urbanization around weather stations), which provide homogenous hourly wind speed data sets from 1959 to 2021. The systematic errors of the models' simulations derived from three approaches are similarly small, with almost two orders of magnitude smaller than ERA5 original data sets. The systematic errors of reconstructed data sets derived from stepwise regression are similar to its simulation; however, the biases from two machine learning methods are even much greater than ERA5 original data sets. This result implies that machine learning methods are not suitable for such typical time‐series predictions using the previous‐hour wind speed as a predictor to reconstruct wind speed data for the next hour. Therefore, stepwise regression was selected to reconstruct hourly wind speed data sets, which have much better quality than ERA5 reanalysis data with the median correctness increased by >50% and the median rRMSE decreased by 25%–50%. Consequently, the reconstructed wind speed data sets have great potential to be useful for more precisely assessing the characteristics/trends of wind energy resources in the past 60 years over China.
first_indexed 2024-03-09T14:18:11Z
format Article
id doaj.art-e58a9042eedb4efaa1f232cba61c381b
institution Directory Open Access Journal
issn 2333-5084
language English
last_indexed 2024-03-09T14:18:11Z
publishDate 2023-11-01
publisher American Geophysical Union (AGU)
record_format Article
series Earth and Space Science
spelling doaj.art-e58a9042eedb4efaa1f232cba61c381b2023-11-28T20:18:31ZengAmerican Geophysical Union (AGU)Earth and Space Science2333-50842023-11-011011n/an/a10.1029/2023EA003295Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over ChinaChad Shouquan Cheng0Shufeng Yang1Jian Zhu2Tao Yang3Gaozhen Nie4Jian Tang5Yanyan Lu6State Key Laboratory of Hydrology‐Water Resources and Hydraulic Engineering Center for Global Change and Water Cycle Hohai University Nanjing ChinaCollege of Hydrology and Water Resources Hohai University Nanjing ChinaState Key Laboratory of Hydrology‐Water Resources and Hydraulic Engineering Center for Global Change and Water Cycle Hohai University Nanjing ChinaState Key Laboratory of Hydrology‐Water Resources and Hydraulic Engineering Center for Global Change and Water Cycle Hohai University Nanjing ChinaNational Meteorological Center China Meteorological Administration Beijing ChinaNational Meteorological Center China Meteorological Administration Beijing ChinaKey Laboratory of Far‐shore Wind Power Technology of Zhejiang Province Hangzhou ChinaAbstract The reconstruction models of hourly 10‐m wind speeds were developed for each of 2384 stations over China using stepwise regression, random forest and XGBoost machine learning approaches based on hourly observed and ERA5 reanalysis data from 2005 to 2021. The reconstruction procedures applied observed hourly data to reduce the systematic biases of reanalysis data sets. Furthermore, the procedures employed the past dynamically consistent states of the atmosphere simulated by ERA5 reanalysis techniques to reduce/remove wind‐observed data impacts of long‐term non‐meteorological condition changes over time (e.g., urbanization around weather stations), which provide homogenous hourly wind speed data sets from 1959 to 2021. The systematic errors of the models' simulations derived from three approaches are similarly small, with almost two orders of magnitude smaller than ERA5 original data sets. The systematic errors of reconstructed data sets derived from stepwise regression are similar to its simulation; however, the biases from two machine learning methods are even much greater than ERA5 original data sets. This result implies that machine learning methods are not suitable for such typical time‐series predictions using the previous‐hour wind speed as a predictor to reconstruct wind speed data for the next hour. Therefore, stepwise regression was selected to reconstruct hourly wind speed data sets, which have much better quality than ERA5 reanalysis data with the median correctness increased by >50% and the median rRMSE decreased by 25%–50%. Consequently, the reconstructed wind speed data sets have great potential to be useful for more precisely assessing the characteristics/trends of wind energy resources in the past 60 years over China.https://doi.org/10.1029/2023EA003295hourly wind speed reconstructionmachine learningstepwise regressionhomogenous wind speed dataChina
spellingShingle Chad Shouquan Cheng
Shufeng Yang
Jian Zhu
Tao Yang
Gaozhen Nie
Jian Tang
Yanyan Lu
Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China
Earth and Space Science
hourly wind speed reconstruction
machine learning
stepwise regression
homogenous wind speed data
China
title Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China
title_full Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China
title_fullStr Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China
title_full_unstemmed Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China
title_short Reconstruction of Near‐Surface Hourly Wind Speed Data Sets During the Period 1959–2021 Over China
title_sort reconstruction of near surface hourly wind speed data sets during the period 1959 2021 over china
topic hourly wind speed reconstruction
machine learning
stepwise regression
homogenous wind speed data
China
url https://doi.org/10.1029/2023EA003295
work_keys_str_mv AT chadshouquancheng reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina
AT shufengyang reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina
AT jianzhu reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina
AT taoyang reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina
AT gaozhennie reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina
AT jiantang reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina
AT yanyanlu reconstructionofnearsurfacehourlywindspeeddatasetsduringtheperiod19592021overchina