How far can the statistical error estimation problem be closed by collocated data?

<p>Accurate specification of the error statistics required for data assimilation remains an ongoing challenge, partly because their estimation is an underdetermined problem that requires statistical assumptions. Even with the common assumption that background and observation errors are uncorre...

Full description

Bibliographic Details
Main Authors: A. Vogel, R. Ménard
Format: Article
Language:English
Published: Copernicus Publications 2023-09-01
Series:Nonlinear Processes in Geophysics
Online Access:https://npg.copernicus.org/articles/30/375/2023/npg-30-375-2023.pdf
_version_ 1797681430149464064
author A. Vogel
A. Vogel
R. Ménard
author_facet A. Vogel
A. Vogel
R. Ménard
author_sort A. Vogel
collection DOAJ
description <p>Accurate specification of the error statistics required for data assimilation remains an ongoing challenge, partly because their estimation is an underdetermined problem that requires statistical assumptions. Even with the common assumption that background and observation errors are uncorrelated, the problem remains underdetermined. One natural question that could arise is as follows: can the increasing amount of overlapping observations or other datasets help to reduce the total number of statistical assumptions, or do they introduce more statistical unknowns? In order to answer this question, this paper provides a conceptual view on the statistical error estimation problem for multiple collocated datasets, including a generalized mathematical formulation, an illustrative demonstration with synthetic data, and guidelines for setting up and solving the problem. It is demonstrated that the required number of statistical assumptions increases linearly with the number of datasets. However, the number of error statistics that can be estimated increases quadratically, allowing for an estimation of an increasing number of error cross-statistics between datasets for more than three datasets. The presented generalized estimation of full error covariance and cross-covariance matrices between datasets does not necessarily accumulate the uncertainties of assumptions among error estimations of multiple datasets.</p>
first_indexed 2024-03-11T23:44:43Z
format Article
id doaj.art-274716b7d7384c3e8455e78864596981
institution Directory Open Access Journal
issn 1023-5809
1607-7946
language English
last_indexed 2024-03-11T23:44:43Z
publishDate 2023-09-01
publisher Copernicus Publications
record_format Article
series Nonlinear Processes in Geophysics
spelling doaj.art-274716b7d7384c3e8455e788645969812023-09-19T10:24:53ZengCopernicus PublicationsNonlinear Processes in Geophysics1023-58091607-79462023-09-013037539810.5194/npg-30-375-2023How far can the statistical error estimation problem be closed by collocated data?A. Vogel0A. Vogel1R. Ménard2Air Quality Research Division, Environment and Climate Change Canada (ECCC), Dorval, Quebec, CanadaRhenish Institute for Environmental Research at the University of Cologne (RIU), Cologne, GermanyAir Quality Research Division, Environment and Climate Change Canada (ECCC), Dorval, Quebec, Canada<p>Accurate specification of the error statistics required for data assimilation remains an ongoing challenge, partly because their estimation is an underdetermined problem that requires statistical assumptions. Even with the common assumption that background and observation errors are uncorrelated, the problem remains underdetermined. One natural question that could arise is as follows: can the increasing amount of overlapping observations or other datasets help to reduce the total number of statistical assumptions, or do they introduce more statistical unknowns? In order to answer this question, this paper provides a conceptual view on the statistical error estimation problem for multiple collocated datasets, including a generalized mathematical formulation, an illustrative demonstration with synthetic data, and guidelines for setting up and solving the problem. It is demonstrated that the required number of statistical assumptions increases linearly with the number of datasets. However, the number of error statistics that can be estimated increases quadratically, allowing for an estimation of an increasing number of error cross-statistics between datasets for more than three datasets. The presented generalized estimation of full error covariance and cross-covariance matrices between datasets does not necessarily accumulate the uncertainties of assumptions among error estimations of multiple datasets.</p>https://npg.copernicus.org/articles/30/375/2023/npg-30-375-2023.pdf
spellingShingle A. Vogel
A. Vogel
R. Ménard
How far can the statistical error estimation problem be closed by collocated data?
Nonlinear Processes in Geophysics
title How far can the statistical error estimation problem be closed by collocated data?
title_full How far can the statistical error estimation problem be closed by collocated data?
title_fullStr How far can the statistical error estimation problem be closed by collocated data?
title_full_unstemmed How far can the statistical error estimation problem be closed by collocated data?
title_short How far can the statistical error estimation problem be closed by collocated data?
title_sort how far can the statistical error estimation problem be closed by collocated data
url https://npg.copernicus.org/articles/30/375/2023/npg-30-375-2023.pdf
work_keys_str_mv AT avogel howfarcanthestatisticalerrorestimationproblembeclosedbycollocateddata
AT avogel howfarcanthestatisticalerrorestimationproblembeclosedbycollocateddata
AT rmenard howfarcanthestatisticalerrorestimationproblembeclosedbycollocateddata