Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news
Fake news poses a grave threat with devastating consequences in this information-centric age. While advances in data science undeniably hold the key to accurately detecting and curtailing the unfettered spread of fake news, guidance on the selection of algorithms and models that are best suited to a...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2022-11-01
|
Series: | International Journal of Information Management Data Insights |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2667096822000763 |
_version_ | 1797986473196126208 |
---|---|
author | Pramukh Nanjundaswamy Vasist M.P. Sebastian |
author_facet | Pramukh Nanjundaswamy Vasist M.P. Sebastian |
author_sort | Pramukh Nanjundaswamy Vasist |
collection | DOAJ |
description | Fake news poses a grave threat with devastating consequences in this information-centric age. While advances in data science undeniably hold the key to accurately detecting and curtailing the unfettered spread of fake news, guidance on the selection of algorithms and models that are best suited to a specific fake news scenario leaves much to be desired. Most studies have focused on fake news in a specific domain and employed a limited range of algorithmic techniques. In contrast, the thematic diversity of fake news raises questions over the comprehensiveness of such techniques, whose performance drops when exposed to fake news from a different domain. The current study responds to this call for guidance by focusing on thematically diverse datasets, applying a series of complex algorithms, and performing topic modeling on them. The results demonstrate that ensemble techniques outperform other algorithms, achieving high levels of accuracy of over 98 percent and 95 percent on thematically diverse and pandemic-related datasets, respectively. The study also demonstrates that neural networks are not a panacea for all situations, while topic modeling helps illustrate the lack of coherence in fake news articles. The study offers a distinct perspective on the accuracy of a diverse set of algorithmic approaches and their ability to adapt to an ever-evolving multi-domain world of fake news. A key implication of the study is the unique and comprehensive view of classification performance when exposed to diverse datasets, including pandemic-related news and data from other disciplines, as opposed to its performance on pandemic-related data alone. Our practical contribution is truly the comparative perspective we offer to practitioners when a choice of algorithm is to be made to accurately detect fake news with thematic heterogeneity. |
first_indexed | 2024-04-11T07:34:01Z |
format | Article |
id | doaj.art-771d60c6117c44b39481e9c4eb649a7a |
institution | Directory Open Access Journal |
issn | 2667-0968 |
language | English |
last_indexed | 2024-04-11T07:34:01Z |
publishDate | 2022-11-01 |
publisher | Elsevier |
record_format | Article |
series | International Journal of Information Management Data Insights |
spelling | doaj.art-771d60c6117c44b39481e9c4eb649a7a2022-12-22T04:36:47ZengElsevierInternational Journal of Information Management Data Insights2667-09682022-11-0122100133Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake newsPramukh Nanjundaswamy Vasist0M.P. Sebastian1Corresponding author.; Information Systems Area, Indian Institute of Management Kozhikode, IndiaInformation Systems Area, Indian Institute of Management Kozhikode, IndiaFake news poses a grave threat with devastating consequences in this information-centric age. While advances in data science undeniably hold the key to accurately detecting and curtailing the unfettered spread of fake news, guidance on the selection of algorithms and models that are best suited to a specific fake news scenario leaves much to be desired. Most studies have focused on fake news in a specific domain and employed a limited range of algorithmic techniques. In contrast, the thematic diversity of fake news raises questions over the comprehensiveness of such techniques, whose performance drops when exposed to fake news from a different domain. The current study responds to this call for guidance by focusing on thematically diverse datasets, applying a series of complex algorithms, and performing topic modeling on them. The results demonstrate that ensemble techniques outperform other algorithms, achieving high levels of accuracy of over 98 percent and 95 percent on thematically diverse and pandemic-related datasets, respectively. The study also demonstrates that neural networks are not a panacea for all situations, while topic modeling helps illustrate the lack of coherence in fake news articles. The study offers a distinct perspective on the accuracy of a diverse set of algorithmic approaches and their ability to adapt to an ever-evolving multi-domain world of fake news. A key implication of the study is the unique and comprehensive view of classification performance when exposed to diverse datasets, including pandemic-related news and data from other disciplines, as opposed to its performance on pandemic-related data alone. Our practical contribution is truly the comparative perspective we offer to practitioners when a choice of algorithm is to be made to accurately detect fake news with thematic heterogeneity.http://www.sciencedirect.com/science/article/pii/S2667096822000763EnsembleFake newsMachine learningNeural networksText classification |
spellingShingle | Pramukh Nanjundaswamy Vasist M.P. Sebastian Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news International Journal of Information Management Data Insights Ensemble Fake news Machine learning Neural networks Text classification |
title | Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news |
title_full | Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news |
title_fullStr | Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news |
title_full_unstemmed | Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news |
title_short | Tackling the infodemic during a pandemic: A comparative study on algorithms to deal with thematically heterogeneous fake news |
title_sort | tackling the infodemic during a pandemic a comparative study on algorithms to deal with thematically heterogeneous fake news |
topic | Ensemble Fake news Machine learning Neural networks Text classification |
url | http://www.sciencedirect.com/science/article/pii/S2667096822000763 |
work_keys_str_mv | AT pramukhnanjundaswamyvasist tacklingtheinfodemicduringapandemicacomparativestudyonalgorithmstodealwiththematicallyheterogeneousfakenews AT mpsebastian tacklingtheinfodemicduringapandemicacomparativestudyonalgorithmstodealwiththematicallyheterogeneousfakenews |