High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA

ABSTRACT: Objective: Incorporating spatial analyses and online health information queries may be beneficial in understanding the role of Google relative search volume (RSV) data as a secondary public health surveillance tool during pandemics. This study identified coronavirus disease 2019 (COVID-19)...

Full description

Bibliographic Details
Main Authors: Atina Husnayain, Ting-Wu Chuang, Anis Fuad, Emily Chia-Yu Su
Format: Article
Language:English
Published: Elsevier 2021-08-01
Series:International Journal of Infectious Diseases
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1201971221005877
_version_ 1818681859432775680
author Atina Husnayain
Ting-Wu Chuang
Anis Fuad
Emily Chia-Yu Su
author_facet Atina Husnayain
Ting-Wu Chuang
Anis Fuad
Emily Chia-Yu Su
author_sort Atina Husnayain
collection DOAJ
description ABSTRACT: Objective: Incorporating spatial analyses and online health information queries may be beneficial in understanding the role of Google relative search volume (RSV) data as a secondary public health surveillance tool during pandemics. This study identified coronavirus disease 2019 (COVID-19) clustering and defined the predictability performance of Google RSV models in clustered and non-clustered areas of the USA.Methods: Getis-Ord General and local G statistics were used to identify monthly clustering patterns. Monthly country- and state-level correlations between new daily COVID-19 cases and Google RSVs were assessed using Spearman's rank correlation coefficients and Poisson regression models for January–December 2020.Results: Huge clusters involving multiple states were found, which resulted from various control measures in each state. This demonstrates the importance of state-to-state coordination in implementing control measures to tackle the spread of outbreaks. Variability in Google RSV model performance was found among states and time periods, possibly suggesting the need to use different frameworks for Google RSV data in each state. Moreover, the sign of correlation can be utilized to understand public responses to control and preventive measures, as well as in communicating risk.Conclusion: COVID-19 Google RSV model accuracy in the USA may be influenced by COVID-19 transmission dynamics, policy-driven community awareness and past outbreak experiences.
first_indexed 2024-12-17T10:09:39Z
format Article
id doaj.art-d76a95e4808a4c78b0904d9b3ef11c24
institution Directory Open Access Journal
issn 1201-9712
language English
last_indexed 2024-12-17T10:09:39Z
publishDate 2021-08-01
publisher Elsevier
record_format Article
series International Journal of Infectious Diseases
spelling doaj.art-d76a95e4808a4c78b0904d9b3ef11c242022-12-21T21:53:03ZengElsevierInternational Journal of Infectious Diseases1201-97122021-08-01109269278High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USAAtina Husnayain0Ting-Wu Chuang1Anis Fuad2Emily Chia-Yu Su3Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Department of Biostatistics, Epidemiology and Population Health, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, IndonesiaDepartment of Molecular Parasitology and Tropical Diseases, School of Medicine, College of Medicine, Taipei Medical University, Taipei, TaiwanDepartment of Biostatistics, Epidemiology and Population Health, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, IndonesiaGraduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Clinical Big Data Research Centre, Taipei Medical University Hospital, Taipei, Taiwan; Corresponding author. Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, 172-1 Keelung Rd., Sec. 2, Taipei 106, Taiwan. Tel.: +886-2-66382736 ext. 1515.ABSTRACT: Objective: Incorporating spatial analyses and online health information queries may be beneficial in understanding the role of Google relative search volume (RSV) data as a secondary public health surveillance tool during pandemics. This study identified coronavirus disease 2019 (COVID-19) clustering and defined the predictability performance of Google RSV models in clustered and non-clustered areas of the USA.Methods: Getis-Ord General and local G statistics were used to identify monthly clustering patterns. Monthly country- and state-level correlations between new daily COVID-19 cases and Google RSVs were assessed using Spearman's rank correlation coefficients and Poisson regression models for January–December 2020.Results: Huge clusters involving multiple states were found, which resulted from various control measures in each state. This demonstrates the importance of state-to-state coordination in implementing control measures to tackle the spread of outbreaks. Variability in Google RSV model performance was found among states and time periods, possibly suggesting the need to use different frameworks for Google RSV data in each state. Moreover, the sign of correlation can be utilized to understand public responses to control and preventive measures, as well as in communicating risk.Conclusion: COVID-19 Google RSV model accuracy in the USA may be influenced by COVID-19 transmission dynamics, policy-driven community awareness and past outbreak experiences.http://www.sciencedirect.com/science/article/pii/S1201971221005877COVID-19United StatesSpatial analysisGoogle TrendsPredictability performanceInfodemiology
spellingShingle Atina Husnayain
Ting-Wu Chuang
Anis Fuad
Emily Chia-Yu Su
High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA
International Journal of Infectious Diseases
COVID-19
United States
Spatial analysis
Google Trends
Predictability performance
Infodemiology
title High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA
title_full High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA
title_fullStr High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA
title_full_unstemmed High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA
title_short High variability in model performance of Google relative search volumes in spatially clustered COVID-19 areas of the USA
title_sort high variability in model performance of google relative search volumes in spatially clustered covid 19 areas of the usa
topic COVID-19
United States
Spatial analysis
Google Trends
Predictability performance
Infodemiology
url http://www.sciencedirect.com/science/article/pii/S1201971221005877
work_keys_str_mv AT atinahusnayain highvariabilityinmodelperformanceofgooglerelativesearchvolumesinspatiallyclusteredcovid19areasoftheusa
AT tingwuchuang highvariabilityinmodelperformanceofgooglerelativesearchvolumesinspatiallyclusteredcovid19areasoftheusa
AT anisfuad highvariabilityinmodelperformanceofgooglerelativesearchvolumesinspatiallyclusteredcovid19areasoftheusa
AT emilychiayusu highvariabilityinmodelperformanceofgooglerelativesearchvolumesinspatiallyclusteredcovid19areasoftheusa