Anti-clustering in the national SARS-CoV-2 daily infection counts

The noise in daily infection counts of an epidemic should be super-Poissonian due to intrinsic epidemiological and administrative clustering. Here, we use this clustering to classify the official national SARS-CoV-2 daily infection counts and check for infection counts that are unusually anti-cluste...

Full description

Bibliographic Details
Main Author: Boudewijn F. Roukema
Format: Article
Language:English
Published: PeerJ Inc. 2021-08-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/11856.pdf
_version_ 1827612383853412352
author Boudewijn F. Roukema
author_facet Boudewijn F. Roukema
author_sort Boudewijn F. Roukema
collection DOAJ
description The noise in daily infection counts of an epidemic should be super-Poissonian due to intrinsic epidemiological and administrative clustering. Here, we use this clustering to classify the official national SARS-CoV-2 daily infection counts and check for infection counts that are unusually anti-clustered. We adopt a one-parameter model of $\phi _i^{\prime}$ϕi′ infections per cluster, dividing any daily count ni into $n_i/ _i^{\prime}$ni/ϕi′ ‘clusters’, for ‘country’ i. We assume that ${n_i}/\phi _i^{\prime}$ni/ϕi′ on a given day j is drawn from a Poisson distribution whose mean is robustly estimated from the four neighbouring days, and calculate the inferred Poisson probability $P_{ij}^{\prime}$Pij′ of the observation. The $P_{ij}^{\prime}$Pij′ values should be uniformly distributed. We find the value $\phi_i$ϕi that minimises the Kolmogorov–Smirnov distance from a uniform distribution. We investigate the (ϕi, Ni) distribution, for total infection count Ni. We consider consecutive count sequences above a threshold of 50 daily infections. We find that most of the daily infection count sequences are inconsistent with a Poissonian model. Most are found to be consistent with the ϕi model. The 28-, 14- and 7-day least noisy sequences for several countries are best modelled as sub-Poissonian, suggesting a distinct epidemiological family. The 28-day least noisy sequence of Algeria has a preferred model that is strongly sub-Poissonian, with $\phi _i^{28} < 0.1$ϕi28<0.1 . Tajikistan, Turkey, Russia, Belarus, Albania, United Arab Emirates and Nicaragua have preferred models that are also sub-Poissonian, with $\phi _i^{28} < 0.5$ϕi28<0.5 . A statistically significant (Pτ < 0.05) correlation was found between the lack of media freedom in a country, as represented by a high Reporters sans frontieres Press Freedom Index (PFI2020), and the lack of statistical noise in the country’s daily counts. The ϕi model appears to be an effective detector of suspiciously low statistical noise in the national SARS-CoV-2 daily infection counts.
first_indexed 2024-03-09T08:20:39Z
format Article
id doaj.art-5f04a4e9707847be936e4aab50df9f37
institution Directory Open Access Journal
issn 2167-8359
language English
last_indexed 2024-03-09T08:20:39Z
publishDate 2021-08-01
publisher PeerJ Inc.
record_format Article
series PeerJ
spelling doaj.art-5f04a4e9707847be936e4aab50df9f372023-12-02T21:49:47ZengPeerJ Inc.PeerJ2167-83592021-08-019e1185610.7717/peerj.11856Anti-clustering in the national SARS-CoV-2 daily infection countsBoudewijn F. Roukema0Institute of Astronomy, Faculty of Physics, Astronomy and Informatics, ul. Grudziadzka 5, Nicolaus Copernicus University of Torun, Torun, PolandThe noise in daily infection counts of an epidemic should be super-Poissonian due to intrinsic epidemiological and administrative clustering. Here, we use this clustering to classify the official national SARS-CoV-2 daily infection counts and check for infection counts that are unusually anti-clustered. We adopt a one-parameter model of $\phi _i^{\prime}$ϕi′ infections per cluster, dividing any daily count ni into $n_i/ _i^{\prime}$ni/ϕi′ ‘clusters’, for ‘country’ i. We assume that ${n_i}/\phi _i^{\prime}$ni/ϕi′ on a given day j is drawn from a Poisson distribution whose mean is robustly estimated from the four neighbouring days, and calculate the inferred Poisson probability $P_{ij}^{\prime}$Pij′ of the observation. The $P_{ij}^{\prime}$Pij′ values should be uniformly distributed. We find the value $\phi_i$ϕi that minimises the Kolmogorov–Smirnov distance from a uniform distribution. We investigate the (ϕi, Ni) distribution, for total infection count Ni. We consider consecutive count sequences above a threshold of 50 daily infections. We find that most of the daily infection count sequences are inconsistent with a Poissonian model. Most are found to be consistent with the ϕi model. The 28-, 14- and 7-day least noisy sequences for several countries are best modelled as sub-Poissonian, suggesting a distinct epidemiological family. The 28-day least noisy sequence of Algeria has a preferred model that is strongly sub-Poissonian, with $\phi _i^{28} < 0.1$ϕi28<0.1 . Tajikistan, Turkey, Russia, Belarus, Albania, United Arab Emirates and Nicaragua have preferred models that are also sub-Poissonian, with $\phi _i^{28} < 0.5$ϕi28<0.5 . A statistically significant (Pτ < 0.05) correlation was found between the lack of media freedom in a country, as represented by a high Reporters sans frontieres Press Freedom Index (PFI2020), and the lack of statistical noise in the country’s daily counts. The ϕi model appears to be an effective detector of suspiciously low statistical noise in the national SARS-CoV-2 daily infection counts.https://peerj.com/articles/11856.pdfCOVID-19Data validationPoisson point processSARS-CoV-2
spellingShingle Boudewijn F. Roukema
Anti-clustering in the national SARS-CoV-2 daily infection counts
PeerJ
COVID-19
Data validation
Poisson point process
SARS-CoV-2
title Anti-clustering in the national SARS-CoV-2 daily infection counts
title_full Anti-clustering in the national SARS-CoV-2 daily infection counts
title_fullStr Anti-clustering in the national SARS-CoV-2 daily infection counts
title_full_unstemmed Anti-clustering in the national SARS-CoV-2 daily infection counts
title_short Anti-clustering in the national SARS-CoV-2 daily infection counts
title_sort anti clustering in the national sars cov 2 daily infection counts
topic COVID-19
Data validation
Poisson point process
SARS-CoV-2
url https://peerj.com/articles/11856.pdf
work_keys_str_mv AT boudewijnfroukema anticlusteringinthenationalsarscov2dailyinfectioncounts