Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia

BackgroundSince COVID-19 was declared a pandemic by the World Health Organization on March 11, 2020, the disease has had an unprecedented impact worldwide. Social media such as Reddit can serve as a resource for enhancing situational awareness, particularly regarding monitori...

Full description

Bibliographic Details
Main Authors: Mengke Hu, Mike Conway
Format: Article
Language:English
Published: JMIR Publications 2022-09-01
Series:JMIR Infodemiology
Online Access:https://infodemiology.jmir.org/2022/2/e36941
_version_ 1797734740294369280
author Mengke Hu
Mike Conway
author_facet Mengke Hu
Mike Conway
author_sort Mengke Hu
collection DOAJ
description BackgroundSince COVID-19 was declared a pandemic by the World Health Organization on March 11, 2020, the disease has had an unprecedented impact worldwide. Social media such as Reddit can serve as a resource for enhancing situational awareness, particularly regarding monitoring public attitudes and behavior during the crisis. Insights gained can then be utilized to better understand public attitudes and behaviors during the COVID-19 crisis, and to support communication and health-promotion messaging. ObjectiveThe aim of this study was to compare public attitudes toward the 2020-2021 COVID-19 pandemic across four predominantly English-speaking countries (the United States, the United Kingdom, Canada, and Australia) using data derived from the social media platform Reddit. MethodsWe utilized a topic modeling natural language processing method (more specifically latent Dirichlet allocation). Topic modeling is a popular unsupervised learning technique that can be used to automatically infer topics (ie, semantically related categories) from a large corpus of text. We derived our data from six country-specific, COVID-19–related subreddits (r/CoronavirusAustralia, r/CoronavirusDownunder, r/CoronavirusCanada, r/CanadaCoronavirus, r/CoronavirusUK, and r/coronavirusus). We used topic modeling methods to investigate and compare topics of concern for each country. ResultsOur consolidated Reddit data set consisted of 84,229 initiating posts and 1,094,853 associated comments collected between February and November 2020 for the United States, the United Kingdom, Canada, and Australia. The volume of posting in COVID-19–related subreddits declined consistently across all four countries during the study period (February 2020 to November 2020). During lockdown events, the volume of posts peaked. The UK and Australian subreddits contained much more evidence-based policy discussion than the US or Canadian subreddits. ConclusionsThis study provides evidence to support the contention that there are key differences between salient topics discussed across the four countries on the Reddit platform. Further, our approach indicates that Reddit data have the potential to provide insights not readily apparent in survey-based approaches.
first_indexed 2024-03-12T12:48:52Z
format Article
id doaj.art-bbf0f373b70e4e619a2e7b8842e97c2e
institution Directory Open Access Journal
issn 2564-1891
language English
last_indexed 2024-03-12T12:48:52Z
publishDate 2022-09-01
publisher JMIR Publications
record_format Article
series JMIR Infodemiology
spelling doaj.art-bbf0f373b70e4e619a2e7b8842e97c2e2023-08-28T23:08:22ZengJMIR PublicationsJMIR Infodemiology2564-18912022-09-0122e3694110.2196/36941Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and AustraliaMengke Huhttps://orcid.org/0000-0001-9421-6432Mike Conwayhttps://orcid.org/0000-0002-3209-8108 BackgroundSince COVID-19 was declared a pandemic by the World Health Organization on March 11, 2020, the disease has had an unprecedented impact worldwide. Social media such as Reddit can serve as a resource for enhancing situational awareness, particularly regarding monitoring public attitudes and behavior during the crisis. Insights gained can then be utilized to better understand public attitudes and behaviors during the COVID-19 crisis, and to support communication and health-promotion messaging. ObjectiveThe aim of this study was to compare public attitudes toward the 2020-2021 COVID-19 pandemic across four predominantly English-speaking countries (the United States, the United Kingdom, Canada, and Australia) using data derived from the social media platform Reddit. MethodsWe utilized a topic modeling natural language processing method (more specifically latent Dirichlet allocation). Topic modeling is a popular unsupervised learning technique that can be used to automatically infer topics (ie, semantically related categories) from a large corpus of text. We derived our data from six country-specific, COVID-19–related subreddits (r/CoronavirusAustralia, r/CoronavirusDownunder, r/CoronavirusCanada, r/CanadaCoronavirus, r/CoronavirusUK, and r/coronavirusus). We used topic modeling methods to investigate and compare topics of concern for each country. ResultsOur consolidated Reddit data set consisted of 84,229 initiating posts and 1,094,853 associated comments collected between February and November 2020 for the United States, the United Kingdom, Canada, and Australia. The volume of posting in COVID-19–related subreddits declined consistently across all four countries during the study period (February 2020 to November 2020). During lockdown events, the volume of posts peaked. The UK and Australian subreddits contained much more evidence-based policy discussion than the US or Canadian subreddits. ConclusionsThis study provides evidence to support the contention that there are key differences between salient topics discussed across the four countries on the Reddit platform. Further, our approach indicates that Reddit data have the potential to provide insights not readily apparent in survey-based approaches.https://infodemiology.jmir.org/2022/2/e36941
spellingShingle Mengke Hu
Mike Conway
Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia
JMIR Infodemiology
title Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia
title_full Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia
title_fullStr Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia
title_full_unstemmed Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia
title_short Perspectives of the COVID-19 Pandemic on Reddit: Comparative Natural Language Processing Study of the United States, the United Kingdom, Canada, and Australia
title_sort perspectives of the covid 19 pandemic on reddit comparative natural language processing study of the united states the united kingdom canada and australia
url https://infodemiology.jmir.org/2022/2/e36941
work_keys_str_mv AT mengkehu perspectivesofthecovid19pandemiconredditcomparativenaturallanguageprocessingstudyoftheunitedstatestheunitedkingdomcanadaandaustralia
AT mikeconway perspectivesofthecovid19pandemiconredditcomparativenaturallanguageprocessingstudyoftheunitedstatestheunitedkingdomcanadaandaustralia