A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting

Earth observation is a growing research area that can capitalize on the powers of artificial intelligence for short time forecasting, a now-casting scenario. In this work, we tackle the challenge of weather forecasting using a video transformer network. Vision transformer architectures have been exp...

Full description

Bibliographic Details
Main Authors: Alabi Bojesomo, Hasan AlMarzouqi, Panos Liatsis
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10285372/
_version_ 1797448891324432384
author Alabi Bojesomo
Hasan AlMarzouqi
Panos Liatsis
author_facet Alabi Bojesomo
Hasan AlMarzouqi
Panos Liatsis
author_sort Alabi Bojesomo
collection DOAJ
description Earth observation is a growing research area that can capitalize on the powers of artificial intelligence for short time forecasting, a now-casting scenario. In this work, we tackle the challenge of weather forecasting using a video transformer network. Vision transformer architectures have been explored in various applications, with major constraints being the computational complexity of attention and the data-hungry training. To address these issues, we propose the use of video Swin-transformer (VST), coupled with a dedicated augmentation scheme. Moreover, we employ gradual spatial reduction on the encoder side and cross-attention on the decoder. The proposed approach is tested on the Weather4Cast2021 weather forecasting challenge data, which requires the prediction of 8 h ahead future frames (4 per hour) from an hourly weather product sequence. The dataset was normalized to 0–1 to facilitate the use of the evaluation metrics across different datasets. The model results in an mse score of 0.4750 when provided with training data, and 0.4420 during transfer learning without using training data, respectively.
first_indexed 2024-03-09T14:16:58Z
format Article
id doaj.art-9c6870079e214d79b2ccdaa8cc943bfd
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-03-09T14:16:58Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-9c6870079e214d79b2ccdaa8cc943bfd2023-11-29T00:00:54ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352024-01-0117455510.1109/JSTARS.2023.332372910285372A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather ForecastingAlabi Bojesomo0https://orcid.org/0000-0002-4685-148XHasan AlMarzouqi1https://orcid.org/0000-0002-2826-1515Panos Liatsis2https://orcid.org/0000-0002-5490-6030Department of Electrical Engineering and Computer Sciences, Khalifa University, Abu Dhabi, United Arab EmiratesDepartment of Electrical Engineering and Computer Sciences, Khalifa University, Abu Dhabi, United Arab EmiratesDepartment of Electrical Engineering and Computer Sciences, Khalifa University, Abu Dhabi, United Arab EmiratesEarth observation is a growing research area that can capitalize on the powers of artificial intelligence for short time forecasting, a now-casting scenario. In this work, we tackle the challenge of weather forecasting using a video transformer network. Vision transformer architectures have been explored in various applications, with major constraints being the computational complexity of attention and the data-hungry training. To address these issues, we propose the use of video Swin-transformer (VST), coupled with a dedicated augmentation scheme. Moreover, we employ gradual spatial reduction on the encoder side and cross-attention on the decoder. The proposed approach is tested on the Weather4Cast2021 weather forecasting challenge data, which requires the prediction of 8 h ahead future frames (4 per hour) from an hourly weather product sequence. The dataset was normalized to 0–1 to facilitate the use of the evaluation metrics across different datasets. The model results in an mse score of 0.4750 when provided with training data, and 0.4420 during transfer learning without using training data, respectively.https://ieeexplore.ieee.org/document/10285372/Encoder–decoder video architecturenow-castingshifted window cross attentionvideo Swin-transformer (VST)weather forecasting
spellingShingle Alabi Bojesomo
Hasan AlMarzouqi
Panos Liatsis
A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Encoder–decoder video architecture
now-casting
shifted window cross attention
video Swin-transformer (VST)
weather forecasting
title A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
title_full A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
title_fullStr A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
title_full_unstemmed A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
title_short A Novel Transformer Network With Shifted Window Cross-Attention for Spatiotemporal Weather Forecasting
title_sort novel transformer network with shifted window cross attention for spatiotemporal weather forecasting
topic Encoder–decoder video architecture
now-casting
shifted window cross attention
video Swin-transformer (VST)
weather forecasting
url https://ieeexplore.ieee.org/document/10285372/
work_keys_str_mv AT alabibojesomo anoveltransformernetworkwithshiftedwindowcrossattentionforspatiotemporalweatherforecasting
AT hasanalmarzouqi anoveltransformernetworkwithshiftedwindowcrossattentionforspatiotemporalweatherforecasting
AT panosliatsis anoveltransformernetworkwithshiftedwindowcrossattentionforspatiotemporalweatherforecasting
AT alabibojesomo noveltransformernetworkwithshiftedwindowcrossattentionforspatiotemporalweatherforecasting
AT hasanalmarzouqi noveltransformernetworkwithshiftedwindowcrossattentionforspatiotemporalweatherforecasting
AT panosliatsis noveltransformernetworkwithshiftedwindowcrossattentionforspatiotemporalweatherforecasting