Using neural networks to improve simulations in the gray zone

<p>Machine learning represents a potential method to cope with the gray zone problem of representing motions in dynamical systems on scales comparable to the model resolution. Here we explore the possibility of using a neural network to directly learn the error caused by unresolved scales. We...

Full description

Bibliographic Details
Main Authors: R. Kriegmair, Y. Ruckstuhl, S. Rasp, G. Craig
Format: Article
Language:English
Published: Copernicus Publications 2022-05-01
Series:Nonlinear Processes in Geophysics
Online Access:https://npg.copernicus.org/articles/29/171/2022/npg-29-171-2022.pdf
_version_ 1818140755385909248
author R. Kriegmair
Y. Ruckstuhl
S. Rasp
G. Craig
author_facet R. Kriegmair
Y. Ruckstuhl
S. Rasp
G. Craig
author_sort R. Kriegmair
collection DOAJ
description <p>Machine learning represents a potential method to cope with the gray zone problem of representing motions in dynamical systems on scales comparable to the model resolution. Here we explore the possibility of using a neural network to directly learn the error caused by unresolved scales. We use a modified shallow water model which includes highly nonlinear processes mimicking atmospheric convection. To create the training dataset, we run the model in a high- and a low-resolution setup and compare the difference after one low-resolution time step, starting from the same initial conditions, thereby obtaining an exact target. The neural network is able to learn a large portion of the difference when evaluated on single time step predictions on a validation dataset. When coupled to the low-resolution model, we find large forecast improvements up to 1 d on average. After this, the accumulated error due to the mass conservation violation of the neural network starts to dominate and deteriorates the forecast. This deterioration can effectively be delayed by adding a penalty term to the loss function used to train the ANN to conserve mass in a weak sense. This study reinforces the need to include physical constraints in neural network parameterizations.</p>
first_indexed 2024-12-11T10:49:02Z
format Article
id doaj.art-1a0c93ccf49c42089fa94afda14339b5
institution Directory Open Access Journal
issn 1023-5809
1607-7946
language English
last_indexed 2024-12-11T10:49:02Z
publishDate 2022-05-01
publisher Copernicus Publications
record_format Article
series Nonlinear Processes in Geophysics
spelling doaj.art-1a0c93ccf49c42089fa94afda14339b52022-12-22T01:10:24ZengCopernicus PublicationsNonlinear Processes in Geophysics1023-58091607-79462022-05-012917118110.5194/npg-29-171-2022Using neural networks to improve simulations in the gray zoneR. Kriegmair0Y. Ruckstuhl1S. Rasp2G. Craig3Meteorological Institute Munich, Ludwig-Maximilians-Universität München, Munich, GermanyMeteorological Institute Munich, Ludwig-Maximilians-Universität München, Munich, GermanyClimateAi, Inc., San Francisco, USAMeteorological Institute Munich, Ludwig-Maximilians-Universität München, Munich, Germany<p>Machine learning represents a potential method to cope with the gray zone problem of representing motions in dynamical systems on scales comparable to the model resolution. Here we explore the possibility of using a neural network to directly learn the error caused by unresolved scales. We use a modified shallow water model which includes highly nonlinear processes mimicking atmospheric convection. To create the training dataset, we run the model in a high- and a low-resolution setup and compare the difference after one low-resolution time step, starting from the same initial conditions, thereby obtaining an exact target. The neural network is able to learn a large portion of the difference when evaluated on single time step predictions on a validation dataset. When coupled to the low-resolution model, we find large forecast improvements up to 1 d on average. After this, the accumulated error due to the mass conservation violation of the neural network starts to dominate and deteriorates the forecast. This deterioration can effectively be delayed by adding a penalty term to the loss function used to train the ANN to conserve mass in a weak sense. This study reinforces the need to include physical constraints in neural network parameterizations.</p>https://npg.copernicus.org/articles/29/171/2022/npg-29-171-2022.pdf
spellingShingle R. Kriegmair
Y. Ruckstuhl
S. Rasp
G. Craig
Using neural networks to improve simulations in the gray zone
Nonlinear Processes in Geophysics
title Using neural networks to improve simulations in the gray zone
title_full Using neural networks to improve simulations in the gray zone
title_fullStr Using neural networks to improve simulations in the gray zone
title_full_unstemmed Using neural networks to improve simulations in the gray zone
title_short Using neural networks to improve simulations in the gray zone
title_sort using neural networks to improve simulations in the gray zone
url https://npg.copernicus.org/articles/29/171/2022/npg-29-171-2022.pdf
work_keys_str_mv AT rkriegmair usingneuralnetworkstoimprovesimulationsinthegrayzone
AT yruckstuhl usingneuralnetworkstoimprovesimulationsinthegrayzone
AT srasp usingneuralnetworkstoimprovesimulationsinthegrayzone
AT gcraig usingneuralnetworkstoimprovesimulationsinthegrayzone