An ensemble-based statistical methodology to detect differences in weather and climate model executables

<p>Since their first operational application in the 1950s, atmospheric numerical models have become essential tools in weather prediction and climate research. As such, they are subject to continuous changes, thanks to advances in computer systems, numerical methods, more and better observatio...

Full description

Bibliographic Details
Main Authors: C. Zeman, C. Schär
Format: Article
Language:English
Published: Copernicus Publications 2022-04-01
Series:Geoscientific Model Development
Online Access:https://gmd.copernicus.org/articles/15/3183/2022/gmd-15-3183-2022.pdf
_version_ 1818051107044196352
author C. Zeman
C. Schär
author_facet C. Zeman
C. Schär
author_sort C. Zeman
collection DOAJ
description <p>Since their first operational application in the 1950s, atmospheric numerical models have become essential tools in weather prediction and climate research. As such, they are subject to continuous changes, thanks to advances in computer systems, numerical methods, more and better observations, and the ever-increasing knowledge about the atmosphere of earth. Many of the changes in today's models relate to seemingly innocuous modifications associated with minor code rearrangements, changes in hardware infrastructure, or software updates. Such changes are meant to preserve the model formulation, yet the verification of such changes is challenged by the chaotic nature of our atmosphere – any small change, even rounding errors, can have a significant impact on individual simulations. Overall, this represents a serious challenge to a consistent model development and maintenance framework.</p> <p>Here we propose a new methodology for quantifying and verifying the impacts of minor changes in the atmospheric model or its underlying hardware/software system by using ensemble simulations in combination with a statistical hypothesis test for instantaneous or hourly values of output variables at the grid-cell level. The methodology can assess the effects of model changes on almost any output variable over time and can be used with different underlying statistical hypothesis tests.</p> <p>We present the first applications of the methodology with the regional weather and climate model COSMO. While providing very robust results, the methodology shows a great sensitivity even to very small changes. Specific changes considered include applying a tiny amount of explicit diffusion, the switch from double to single precision, and a major system update of the underlying supercomputer. Results show that changes are often only detectable during the first hours, suggesting that short-term ensemble simulations (days to months) are best suited for the methodology, even when addressing long-term climate simulations. Furthermore, we show that spatial averaging – as opposed to testing at all grid points – reduces the test's sensitivity for small-scale features such as diffusion. We also show that the choice of the underlying statistical hypothesis test is not essential and that the methodology already works well for coarse resolutions, making it computationally inexpensive and therefore an ideal candidate for automated testing.</p>
first_indexed 2024-12-10T11:04:06Z
format Article
id doaj.art-bec5a3327c3d4c2bbf66c9f016989812
institution Directory Open Access Journal
issn 1991-959X
1991-9603
language English
last_indexed 2024-12-10T11:04:06Z
publishDate 2022-04-01
publisher Copernicus Publications
record_format Article
series Geoscientific Model Development
spelling doaj.art-bec5a3327c3d4c2bbf66c9f0169898122022-12-22T01:51:37ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032022-04-01153183320310.5194/gmd-15-3183-2022An ensemble-based statistical methodology to detect differences in weather and climate model executablesC. ZemanC. Schär<p>Since their first operational application in the 1950s, atmospheric numerical models have become essential tools in weather prediction and climate research. As such, they are subject to continuous changes, thanks to advances in computer systems, numerical methods, more and better observations, and the ever-increasing knowledge about the atmosphere of earth. Many of the changes in today's models relate to seemingly innocuous modifications associated with minor code rearrangements, changes in hardware infrastructure, or software updates. Such changes are meant to preserve the model formulation, yet the verification of such changes is challenged by the chaotic nature of our atmosphere – any small change, even rounding errors, can have a significant impact on individual simulations. Overall, this represents a serious challenge to a consistent model development and maintenance framework.</p> <p>Here we propose a new methodology for quantifying and verifying the impacts of minor changes in the atmospheric model or its underlying hardware/software system by using ensemble simulations in combination with a statistical hypothesis test for instantaneous or hourly values of output variables at the grid-cell level. The methodology can assess the effects of model changes on almost any output variable over time and can be used with different underlying statistical hypothesis tests.</p> <p>We present the first applications of the methodology with the regional weather and climate model COSMO. While providing very robust results, the methodology shows a great sensitivity even to very small changes. Specific changes considered include applying a tiny amount of explicit diffusion, the switch from double to single precision, and a major system update of the underlying supercomputer. Results show that changes are often only detectable during the first hours, suggesting that short-term ensemble simulations (days to months) are best suited for the methodology, even when addressing long-term climate simulations. Furthermore, we show that spatial averaging – as opposed to testing at all grid points – reduces the test's sensitivity for small-scale features such as diffusion. We also show that the choice of the underlying statistical hypothesis test is not essential and that the methodology already works well for coarse resolutions, making it computationally inexpensive and therefore an ideal candidate for automated testing.</p>https://gmd.copernicus.org/articles/15/3183/2022/gmd-15-3183-2022.pdf
spellingShingle C. Zeman
C. Schär
An ensemble-based statistical methodology to detect differences in weather and climate model executables
Geoscientific Model Development
title An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_full An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_fullStr An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_full_unstemmed An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_short An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_sort ensemble based statistical methodology to detect differences in weather and climate model executables
url https://gmd.copernicus.org/articles/15/3183/2022/gmd-15-3183-2022.pdf
work_keys_str_mv AT czeman anensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables
AT cschar anensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables
AT czeman ensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables
AT cschar ensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables