Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news

Fake news poses a grave threat with devastating consequences in this information-centric age. While advances in data science undeniably hold the key to accurately detecting and curtailing the unfettered spread of fake news, guidance on the selection of algorithms and models that are best suited to a...

Full description

Bibliographic Details
Main Authors: Pramukh Nanjundaswamy Vasist, M.P. Sebastian
Format: Article
Language:English
Published: Elsevier 2022-11-01
Series:International Journal of Information Management Data Insights
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2667096822000763
_version_ 1797986473196126208
author Pramukh Nanjundaswamy Vasist
M.P. Sebastian
author_facet Pramukh Nanjundaswamy Vasist
M.P. Sebastian
author_sort Pramukh Nanjundaswamy Vasist
collection DOAJ
description Fake news poses a grave threat with devastating consequences in this information-centric age. While advances in data science undeniably hold the key to accurately detecting and curtailing the unfettered spread of fake news, guidance on the selection of algorithms and models that are best suited to a specific fake news scenario leaves much to be desired. Most studies have focused on fake news in a specific domain and employed a limited range of algorithmic techniques. In contrast, the thematic diversity of fake news raises questions over the comprehensiveness of such techniques, whose performance drops when exposed to fake news from a different domain. The current study responds to this call for guidance by focusing on thematically diverse datasets, applying a series of complex algorithms, and performing topic modeling on them. The results demonstrate that ensemble techniques outperform other algorithms, achieving high levels of accuracy of over 98 percent and 95 percent on thematically diverse and pandemic-related datasets, respectively. The study also demonstrates that neural networks are not a panacea for all situations, while topic modeling helps illustrate the lack of coherence in fake news articles. The study offers a distinct perspective on the accuracy of a diverse set of algorithmic approaches and their ability to adapt to an ever-evolving multi-domain world of fake news. A key implication of the study is the unique and comprehensive view of classification performance when exposed to diverse datasets, including pandemic-related news and data from other disciplines, as opposed to its performance on pandemic-related data alone. Our practical contribution is truly the comparative perspective we offer to practitioners when a choice of algorithm is to be made to accurately detect fake news with thematic heterogeneity.
first_indexed 2024-04-11T07:34:01Z
format Article
id doaj.art-771d60c6117c44b39481e9c4eb649a7a
institution Directory Open Access Journal
issn 2667-0968
language English
last_indexed 2024-04-11T07:34:01Z
publishDate 2022-11-01
publisher Elsevier
record_format Article
series International Journal of Information Management Data Insights
spelling doaj.art-771d60c6117c44b39481e9c4eb649a7a2022-12-22T04:36:47ZengElsevierInternational Journal of Information Management Data Insights2667-09682022-11-0122100133Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake newsPramukh Nanjundaswamy Vasist0M.P. Sebastian1Corresponding author.; Information Systems Area, Indian Institute of Management Kozhikode, IndiaInformation Systems Area, Indian Institute of Management Kozhikode, IndiaFake news poses a grave threat with devastating consequences in this information-centric age. While advances in data science undeniably hold the key to accurately detecting and curtailing the unfettered spread of fake news, guidance on the selection of algorithms and models that are best suited to a specific fake news scenario leaves much to be desired. Most studies have focused on fake news in a specific domain and employed a limited range of algorithmic techniques. In contrast, the thematic diversity of fake news raises questions over the comprehensiveness of such techniques, whose performance drops when exposed to fake news from a different domain. The current study responds to this call for guidance by focusing on thematically diverse datasets, applying a series of complex algorithms, and performing topic modeling on them. The results demonstrate that ensemble techniques outperform other algorithms, achieving high levels of accuracy of over 98 percent and 95 percent on thematically diverse and pandemic-related datasets, respectively. The study also demonstrates that neural networks are not a panacea for all situations, while topic modeling helps illustrate the lack of coherence in fake news articles. The study offers a distinct perspective on the accuracy of a diverse set of algorithmic approaches and their ability to adapt to an ever-evolving multi-domain world of fake news. A key implication of the study is the unique and comprehensive view of classification performance when exposed to diverse datasets, including pandemic-related news and data from other disciplines, as opposed to its performance on pandemic-related data alone. Our practical contribution is truly the comparative perspective we offer to practitioners when a choice of algorithm is to be made to accurately detect fake news with thematic heterogeneity.http://www.sciencedirect.com/science/article/pii/S2667096822000763EnsembleFake newsMachine learningNeural networksText classification
spellingShingle Pramukh Nanjundaswamy Vasist
M.P. Sebastian
Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
International Journal of Information Management Data Insights
Ensemble
Fake news
Machine learning
Neural networks
Text classification
title Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
title_full Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
title_fullStr Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
title_full_unstemmed Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
title_short Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
title_sort tackling the infodemic during a pandemic a comparative study on algorithms to deal with thematically heterogeneous fake news
topic Ensemble
Fake news
Machine learning
Neural networks
Text classification
url http://www.sciencedirect.com/science/article/pii/S2667096822000763
work_keys_str_mv AT pramukhnanjundaswamyvasist tacklingtheinfodemicduringapandemicacomparativestudyonalgorithmstodealwiththematicallyheterogeneousfakenews
AT mpsebastian tacklingtheinfodemicduringapandemicacomparativestudyonalgorithmstodealwiththematicallyheterogeneousfakenews