A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period

One of the unfortunate findings from the ongoing COVID-19 crisis is the disproportionate impact the crisis has had on people and communities who were already socioeconomically disadvantaged. It has, however, been difficult to study this issue at scale and in greater detail using social media platfor...

Full description

Bibliographic Details
Main Authors: Sara Melotte, Mayank Kejriwal
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/6/6/64
_version_ 1797530010221805568
author Sara Melotte
Mayank Kejriwal
author_facet Sara Melotte
Mayank Kejriwal
author_sort Sara Melotte
collection DOAJ
description One of the unfortunate findings from the ongoing COVID-19 crisis is the disproportionate impact the crisis has had on people and communities who were already socioeconomically disadvantaged. It has, however, been difficult to study this issue at scale and in greater detail using social media platforms like Twitter. Several COVID-19 Twitter datasets have been released, but they have very broad scope, both topically and geographically. In this paper, we present a more controlled and compact dataset that can be used to answer a range of potential research questions (especially pertaining to computational social science) without requiring extensive preprocessing or tweet-hydration from the earlier datasets. The proposed dataset comprises tens of thousands of geotagged (and in many cases, reverse-geocoded) tweets originally collected over a 255-day period in 2020 over 10 metropolitan areas in North America. Since there are socioeconomic disparities within these cities (sometimes to an extreme extent, as witnessed in ‘inner city neighborhoods’ in some of these cities), the dataset can be used to assess such socioeconomic disparities from a social media lens, in addition to comparing and contrasting behavior across cities.
first_indexed 2024-03-10T10:21:48Z
format Article
id doaj.art-c2b46dcd2d644fb8a299fd27ae7adcdb
institution Directory Open Access Journal
issn 2306-5729
language English
last_indexed 2024-03-10T10:21:48Z
publishDate 2021-06-01
publisher MDPI AG
record_format Article
series Data
spelling doaj.art-c2b46dcd2d644fb8a299fd27ae7adcdb2023-11-22T00:21:37ZengMDPI AGData2306-57292021-06-01666410.3390/data6060064A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day PeriodSara Melotte0Mayank Kejriwal1Information Sciences Institute, University of Southern California, 4676 Admiralty Way, Suite 1001, Marina del Rey, CA 90292, USAInformation Sciences Institute, University of Southern California, 4676 Admiralty Way, Suite 1001, Marina del Rey, CA 90292, USAOne of the unfortunate findings from the ongoing COVID-19 crisis is the disproportionate impact the crisis has had on people and communities who were already socioeconomically disadvantaged. It has, however, been difficult to study this issue at scale and in greater detail using social media platforms like Twitter. Several COVID-19 Twitter datasets have been released, but they have very broad scope, both topically and geographically. In this paper, we present a more controlled and compact dataset that can be used to answer a range of potential research questions (especially pertaining to computational social science) without requiring extensive preprocessing or tweet-hydration from the earlier datasets. The proposed dataset comprises tens of thousands of geotagged (and in many cases, reverse-geocoded) tweets originally collected over a 255-day period in 2020 over 10 metropolitan areas in North America. Since there are socioeconomic disparities within these cities (sometimes to an extreme extent, as witnessed in ‘inner city neighborhoods’ in some of these cities), the dataset can be used to assess such socioeconomic disparities from a social media lens, in addition to comparing and contrasting behavior across cities.https://www.mdpi.com/2306-5729/6/6/64COVID-19Twittergeo-taggedmetropolitancomputational social science
spellingShingle Sara Melotte
Mayank Kejriwal
A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period
Data
COVID-19
Twitter
geo-tagged
metropolitan
computational social science
title A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period
title_full A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period
title_fullStr A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period
title_full_unstemmed A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period
title_short A Geo-Tagged COVID-19 Twitter Dataset for 10 North American Metropolitan Areas over a 255-Day Period
title_sort geo tagged covid 19 twitter dataset for 10 north american metropolitan areas over a 255 day period
topic COVID-19
Twitter
geo-tagged
metropolitan
computational social science
url https://www.mdpi.com/2306-5729/6/6/64
work_keys_str_mv AT saramelotte ageotaggedcovid19twitterdatasetfor10northamericanmetropolitanareasovera255dayperiod
AT mayankkejriwal ageotaggedcovid19twitterdatasetfor10northamericanmetropolitanareasovera255dayperiod
AT saramelotte geotaggedcovid19twitterdatasetfor10northamericanmetropolitanareasovera255dayperiod
AT mayankkejriwal geotaggedcovid19twitterdatasetfor10northamericanmetropolitanareasovera255dayperiod