Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa

The global economy has been hard hit by the COVID-19 pandemic. Many countries are experiencing a severe and destructive recession. A significant number of firms and businesses have gone bankrupt or been scaled down, and many individuals have lost their jobs. The main goal of this study is to support...

Full description

Bibliographic Details
Main Authors: Zahra Movahedi Nia, Ali Asgary, Nicola Bragazzi, Bruce Mellado, James Orbinski, Jianhong Wu, Jude Kong
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-12-01
Series:Frontiers in Public Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpubh.2022.952363/full
_version_ 1798014296402165760
author Zahra Movahedi Nia
Ali Asgary
Nicola Bragazzi
Bruce Mellado
James Orbinski
Jianhong Wu
Jude Kong
author_facet Zahra Movahedi Nia
Ali Asgary
Nicola Bragazzi
Bruce Mellado
James Orbinski
Jianhong Wu
Jude Kong
author_sort Zahra Movahedi Nia
collection DOAJ
description The global economy has been hard hit by the COVID-19 pandemic. Many countries are experiencing a severe and destructive recession. A significant number of firms and businesses have gone bankrupt or been scaled down, and many individuals have lost their jobs. The main goal of this study is to support policy- and decision-makers with additional and real-time information about the labor market flow using Twitter data. We leverage the data to trace and nowcast the unemployment rate of South Africa during the COVID-19 pandemic. First, we create a dataset of unemployment-related tweets using certain keywords. Principal Component Regression (PCR) is then applied to nowcast the unemployment rate using the gathered tweets and their sentiment scores. Numerical results indicate that the volume of the tweets has a positive correlation, and the sentiments of the tweets have a negative correlation with the unemployment rate during and before the COVID-19 pandemic. Moreover, the now-casted unemployment rate using PCR has an outstanding evaluation result with a low Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), Symmetric MAPE (SMAPE) of 0.921, 0.018, 0.018, respectively and a high R2-score of 0.929.
first_indexed 2024-04-11T15:16:15Z
format Article
id doaj.art-bd8f44a91bd04e0f8d96c33c8c86d9ed
institution Directory Open Access Journal
issn 2296-2565
language English
last_indexed 2024-04-11T15:16:15Z
publishDate 2022-12-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Public Health
spelling doaj.art-bd8f44a91bd04e0f8d96c33c8c86d9ed2022-12-22T04:16:29ZengFrontiers Media S.A.Frontiers in Public Health2296-25652022-12-011010.3389/fpubh.2022.952363952363Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South AfricaZahra Movahedi Nia0Ali Asgary1Nicola Bragazzi2Bruce Mellado3James Orbinski4Jianhong Wu5Jude Kong6Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Laboratory for Industrial and Applied Mathematics, York University, Toronto, ON, CanadaAfrica-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), The Advanced Disaster, Emergency and Rapid Response Program, York University, Toronto, ON, CanadaAfrica-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Laboratory for Industrial and Applied Mathematics, York University, Toronto, ON, CanadaAfrica-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Schools of Physics, Institute for Collider Particle Physics, University of the Witwatersrand, Johannesburg, South AfricaAfrica-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), The Dahdaleh Institute for Global Health Research, York University, Toronto, ON, CanadaAfrica-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Laboratory for Industrial and Applied Mathematics, York University, Toronto, ON, CanadaAfrica-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Laboratory for Industrial and Applied Mathematics, York University, Toronto, ON, CanadaThe global economy has been hard hit by the COVID-19 pandemic. Many countries are experiencing a severe and destructive recession. A significant number of firms and businesses have gone bankrupt or been scaled down, and many individuals have lost their jobs. The main goal of this study is to support policy- and decision-makers with additional and real-time information about the labor market flow using Twitter data. We leverage the data to trace and nowcast the unemployment rate of South Africa during the COVID-19 pandemic. First, we create a dataset of unemployment-related tweets using certain keywords. Principal Component Regression (PCR) is then applied to nowcast the unemployment rate using the gathered tweets and their sentiment scores. Numerical results indicate that the volume of the tweets has a positive correlation, and the sentiments of the tweets have a negative correlation with the unemployment rate during and before the COVID-19 pandemic. Moreover, the now-casted unemployment rate using PCR has an outstanding evaluation result with a low Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), Symmetric MAPE (SMAPE) of 0.921, 0.018, 0.018, respectively and a high R2-score of 0.929.https://www.frontiersin.org/articles/10.3389/fpubh.2022.952363/fullsentiment analysissocial mediaTwitter dataGoogle Mobility Indexunemployment ratelabor market
spellingShingle Zahra Movahedi Nia
Ali Asgary
Nicola Bragazzi
Bruce Mellado
James Orbinski
Jianhong Wu
Jude Kong
Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa
Frontiers in Public Health
sentiment analysis
social media
Twitter data
Google Mobility Index
unemployment rate
labor market
title Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa
title_full Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa
title_fullStr Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa
title_full_unstemmed Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa
title_short Nowcasting unemployment rate during the COVID-19 pandemic using Twitter data: The case of South Africa
title_sort nowcasting unemployment rate during the covid 19 pandemic using twitter data the case of south africa
topic sentiment analysis
social media
Twitter data
Google Mobility Index
unemployment rate
labor market
url https://www.frontiersin.org/articles/10.3389/fpubh.2022.952363/full
work_keys_str_mv AT zahramovahedinia nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica
AT aliasgary nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica
AT nicolabragazzi nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica
AT brucemellado nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica
AT jamesorbinski nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica
AT jianhongwu nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica
AT judekong nowcastingunemploymentrateduringthecovid19pandemicusingtwitterdatathecaseofsouthafrica