SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation

Abstract The data‐driven approaches for medium‐range weather forecasting are recently shown to be extraordinarily promising for ensemble forecasting due to their fast inference speed compared to the traditional numerical weather prediction models. However, their forecast accuracy can hardly match th...

Full description

Bibliographic Details
Main Authors: Yuan Hu, Lei Chen, Zhibin Wang, Hao Li
Format: Article
Language:English
Published: American Geophysical Union (AGU) 2023-02-01
Series:Journal of Advances in Modeling Earth Systems
Subjects:
Online Access:https://doi.org/10.1029/2022MS003211
_version_ 1827795937820409856
author Yuan Hu
Lei Chen
Zhibin Wang
Hao Li
author_facet Yuan Hu
Lei Chen
Zhibin Wang
Hao Li
author_sort Yuan Hu
collection DOAJ
description Abstract The data‐driven approaches for medium‐range weather forecasting are recently shown to be extraordinarily promising for ensemble forecasting due to their fast inference speed compared to the traditional numerical weather prediction models. However, their forecast accuracy can hardly match the state‐of‐the‐art operational ECMWF Integrated Forecasting System (IFS) model. Previous data‐driven approaches perform ensemble forecasting using some simple perturbation methods, like the initial condition perturbation and the Monte Carlo dropout. However, their ensemble performance is often limited arguably by the sub‐optimal ways of applying perturbation. We propose a Swin Transformer‐based Variational Recurrent Neural Network (SwinVRNN), which is a stochastic weather forecasting model combining a SwinRNN predictor with a perturbation module. SwinRNN is designed as a Swin Transformer‐based recurrent neural network, which predicts the future states deterministically. Furthermore, to model the stochasticity in the prediction, we design a perturbation module following the Variational Auto‐Encoder paradigm to learn the multivariate Gaussian distributions of a time‐variant stochastic latent variable from the data. Ensemble forecasting can be easily performed by perturbing the model features leveraging the noise sampled from the learned distribution. We also compare four categories of perturbation methods for ensemble forecasting, that is, fixed distribution perturbation, learned distribution perturbation, MC dropout, and multi model ensemble. Comparisons on the WeatherBench data set show that the learned distribution perturbation method using our SwinVRNN model achieves remarkably improved forecasting accuracy and reasonable ensemble spread due to the joint optimization of the two targets. More notably, SwinVRNN surpasses operational IFS on the surface variables of the 2‐m temperature and the 6‐hourly total precipitation at all lead times up to 5 days (Code is available at https://github.com/tpys/wwprediction).
first_indexed 2024-03-11T18:58:24Z
format Article
id doaj.art-622301c50d9949089056ec60cc72a4f4
institution Directory Open Access Journal
issn 1942-2466
language English
last_indexed 2024-03-11T18:58:24Z
publishDate 2023-02-01
publisher American Geophysical Union (AGU)
record_format Article
series Journal of Advances in Modeling Earth Systems
spelling doaj.art-622301c50d9949089056ec60cc72a4f42023-10-10T14:00:43ZengAmerican Geophysical Union (AGU)Journal of Advances in Modeling Earth Systems1942-24662023-02-01152n/an/a10.1029/2022MS003211SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution PerturbationYuan Hu0Lei Chen1Zhibin Wang2Hao Li3DAMO Academy Alibaba Group Beijing ChinaDAMO Academy Alibaba Group Beijing ChinaDAMO Academy Alibaba Group Beijing ChinaArtificial Intelligence Innovation and Incubation Institute Fudan University Shanghai ChinaAbstract The data‐driven approaches for medium‐range weather forecasting are recently shown to be extraordinarily promising for ensemble forecasting due to their fast inference speed compared to the traditional numerical weather prediction models. However, their forecast accuracy can hardly match the state‐of‐the‐art operational ECMWF Integrated Forecasting System (IFS) model. Previous data‐driven approaches perform ensemble forecasting using some simple perturbation methods, like the initial condition perturbation and the Monte Carlo dropout. However, their ensemble performance is often limited arguably by the sub‐optimal ways of applying perturbation. We propose a Swin Transformer‐based Variational Recurrent Neural Network (SwinVRNN), which is a stochastic weather forecasting model combining a SwinRNN predictor with a perturbation module. SwinRNN is designed as a Swin Transformer‐based recurrent neural network, which predicts the future states deterministically. Furthermore, to model the stochasticity in the prediction, we design a perturbation module following the Variational Auto‐Encoder paradigm to learn the multivariate Gaussian distributions of a time‐variant stochastic latent variable from the data. Ensemble forecasting can be easily performed by perturbing the model features leveraging the noise sampled from the learned distribution. We also compare four categories of perturbation methods for ensemble forecasting, that is, fixed distribution perturbation, learned distribution perturbation, MC dropout, and multi model ensemble. Comparisons on the WeatherBench data set show that the learned distribution perturbation method using our SwinVRNN model achieves remarkably improved forecasting accuracy and reasonable ensemble spread due to the joint optimization of the two targets. More notably, SwinVRNN surpasses operational IFS on the surface variables of the 2‐m temperature and the 6‐hourly total precipitation at all lead times up to 5 days (Code is available at https://github.com/tpys/wwprediction).https://doi.org/10.1029/2022MS003211medium‐range weather forecastingdata‐driven methodensemble forecastlearned distribution perturbation
spellingShingle Yuan Hu
Lei Chen
Zhibin Wang
Hao Li
SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation
Journal of Advances in Modeling Earth Systems
medium‐range weather forecasting
data‐driven method
ensemble forecast
learned distribution perturbation
title SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation
title_full SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation
title_fullStr SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation
title_full_unstemmed SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation
title_short SwinVRNN: A Data‐Driven Ensemble Forecasting Model via Learned Distribution Perturbation
title_sort swinvrnn a data driven ensemble forecasting model via learned distribution perturbation
topic medium‐range weather forecasting
data‐driven method
ensemble forecast
learned distribution perturbation
url https://doi.org/10.1029/2022MS003211
work_keys_str_mv AT yuanhu swinvrnnadatadrivenensembleforecastingmodelvialearneddistributionperturbation
AT leichen swinvrnnadatadrivenensembleforecastingmodelvialearneddistributionperturbation
AT zhibinwang swinvrnnadatadrivenensembleforecastingmodelvialearneddistributionperturbation
AT haoli swinvrnnadatadrivenensembleforecastingmodelvialearneddistributionperturbation