An ensemble-based statistical methodology to detect differences in weather and climate model executables

<p>Since their first operational application in the 1950s, atmospheric numerical models have become essential tools in weather prediction and climate research. As such, they are subject to continuous changes, thanks to advances in computer systems, numerical methods, more and better observatio...

Full description

Bibliographic Details
Main Authors:	C. Zeman, C. Schär
Format:	Article
Language:	English
Published:	Copernicus Publications 2022-04-01
Series:	Geoscientific Model Development
Online Access:	https://gmd.copernicus.org/articles/15/3183/2022/gmd-15-3183-2022.pdf

_version_	1818051107044196352
author	C. Zeman C. Schär
author_facet	C. Zeman C. Schär
author_sort	C. Zeman
collection	DOAJ
description	<p>Since their first operational application in the 1950s, atmospheric numerical models have become essential tools in weather prediction and climate research. As such, they are subject to continuous changes, thanks to advances in computer systems, numerical methods, more and better observations, and the ever-increasing knowledge about the atmosphere of earth. Many of the changes in today's models relate to seemingly innocuous modifications associated with minor code rearrangements, changes in hardware infrastructure, or software updates. Such changes are meant to preserve the model formulation, yet the verification of such changes is challenged by the chaotic nature of our atmosphere – any small change, even rounding errors, can have a significant impact on individual simulations. Overall, this represents a serious challenge to a consistent model development and maintenance framework.</p> <p>Here we propose a new methodology for quantifying and verifying the impacts of minor changes in the atmospheric model or its underlying hardware/software system by using ensemble simulations in combination with a statistical hypothesis test for instantaneous or hourly values of output variables at the grid-cell level. The methodology can assess the effects of model changes on almost any output variable over time and can be used with different underlying statistical hypothesis tests.</p> <p>We present the first applications of the methodology with the regional weather and climate model COSMO. While providing very robust results, the methodology shows a great sensitivity even to very small changes. Specific changes considered include applying a tiny amount of explicit diffusion, the switch from double to single precision, and a major system update of the underlying supercomputer. Results show that changes are often only detectable during the first hours, suggesting that short-term ensemble simulations (days to months) are best suited for the methodology, even when addressing long-term climate simulations. Furthermore, we show that spatial averaging – as opposed to testing at all grid points – reduces the test's sensitivity for small-scale features such as diffusion. We also show that the choice of the underlying statistical hypothesis test is not essential and that the methodology already works well for coarse resolutions, making it computationally inexpensive and therefore an ideal candidate for automated testing.</p>
first_indexed	2024-12-10T11:04:06Z
format	Article
id	doaj.art-bec5a3327c3d4c2bbf66c9f016989812
institution	Directory Open Access Journal
issn	1991-959X 1991-9603
language	English
last_indexed	2024-12-10T11:04:06Z
publishDate	2022-04-01
publisher	Copernicus Publications
record_format	Article
series	Geoscientific Model Development
spelling	doaj.art-bec5a3327c3d4c2bbf66c9f0169898122022-12-22T01:51:37ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032022-04-01153183320310.5194/gmd-15-3183-2022An ensemble-based statistical methodology to detect differences in weather and climate model executablesC. ZemanC. Schär<p>Since their first operational application in the 1950s, atmospheric numerical models have become essential tools in weather prediction and climate research. As such, they are subject to continuous changes, thanks to advances in computer systems, numerical methods, more and better observations, and the ever-increasing knowledge about the atmosphere of earth. Many of the changes in today's models relate to seemingly innocuous modifications associated with minor code rearrangements, changes in hardware infrastructure, or software updates. Such changes are meant to preserve the model formulation, yet the verification of such changes is challenged by the chaotic nature of our atmosphere – any small change, even rounding errors, can have a significant impact on individual simulations. Overall, this represents a serious challenge to a consistent model development and maintenance framework.</p> <p>Here we propose a new methodology for quantifying and verifying the impacts of minor changes in the atmospheric model or its underlying hardware/software system by using ensemble simulations in combination with a statistical hypothesis test for instantaneous or hourly values of output variables at the grid-cell level. The methodology can assess the effects of model changes on almost any output variable over time and can be used with different underlying statistical hypothesis tests.</p> <p>We present the first applications of the methodology with the regional weather and climate model COSMO. While providing very robust results, the methodology shows a great sensitivity even to very small changes. Specific changes considered include applying a tiny amount of explicit diffusion, the switch from double to single precision, and a major system update of the underlying supercomputer. Results show that changes are often only detectable during the first hours, suggesting that short-term ensemble simulations (days to months) are best suited for the methodology, even when addressing long-term climate simulations. Furthermore, we show that spatial averaging – as opposed to testing at all grid points – reduces the test's sensitivity for small-scale features such as diffusion. We also show that the choice of the underlying statistical hypothesis test is not essential and that the methodology already works well for coarse resolutions, making it computationally inexpensive and therefore an ideal candidate for automated testing.</p>https://gmd.copernicus.org/articles/15/3183/2022/gmd-15-3183-2022.pdf
spellingShingle	C. Zeman C. Schär An ensemble-based statistical methodology to detect differences in weather and climate model executables Geoscientific Model Development
title	An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_full	An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_fullStr	An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_full_unstemmed	An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_short	An ensemble-based statistical methodology to detect differences in weather and climate model executables
title_sort	ensemble based statistical methodology to detect differences in weather and climate model executables
url	https://gmd.copernicus.org/articles/15/3183/2022/gmd-15-3183-2022.pdf
work_keys_str_mv	AT czeman anensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables AT cschar anensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables AT czeman ensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables AT cschar ensemblebasedstatisticalmethodologytodetectdifferencesinweatherandclimatemodelexecutables

An ensemble-based statistical methodology to detect differences in weather and climate model executables

Similar Items