Ensemble machine learning approaches for fake news classification

In today’s interconnected digital landscape, the proliferation of fake news has become a significant challenge, with far-reaching implications for individuals, institutions, and societies. The rapid spread of misleading information undermines the credibility of genuine news outlets and threatens inf...

Full description

Bibliographic Details
Main Authors: Halyna Padalko, Vasyl Chomko, Sergiy Yakovlev, Dmytro Chumachenko
Format: Article
Language:English
Published: National Aerospace University «Kharkiv Aviation Institute» 2023-12-01
Series:Радіоелектронні і комп'ютерні системи
Subjects:
Online Access:http://nti.khai.edu/ojs/index.php/reks/article/view/2181
_version_ 1797367114961518592
author Halyna Padalko
Vasyl Chomko
Sergiy Yakovlev
Dmytro Chumachenko
author_facet Halyna Padalko
Vasyl Chomko
Sergiy Yakovlev
Dmytro Chumachenko
author_sort Halyna Padalko
collection DOAJ
description In today’s interconnected digital landscape, the proliferation of fake news has become a significant challenge, with far-reaching implications for individuals, institutions, and societies. The rapid spread of misleading information undermines the credibility of genuine news outlets and threatens informed decision-making, public trust, and democratic processes. Recognizing the profound relevance and urgency of addressing this issue, this research embarked on a mission to harness the power of machine learning to combat fake news menace. This study develops an ensemble machine learning model for fake news classification. The research is targeted at spreading fake news. The research subjects are machine learning methods for misinformation classification. Methods: we employed three state-of-the-art algorithms: LightGBM, XGBoost, and Balanced Random Forest (BRF). Each model was meticulously trained on a comprehensive dataset curated to encompass a diverse range of news articles, ensuring a broad representation of linguistic patterns and styles. A distinctive feature of the proposed approach is the emphasis on token importance. By leveraging specific tokens that exhibited a high degree of influence on classification outcomes, we enhanced the precision and reliability of the developed models. The empirical results were both promising and illuminating. The LightGBM model emerged as the top performer among the three, registering an impressive F1-score of 97.74% and an accuracy rate of 97.64%. Notably, all three of the proposed models consistently outperformed several existing models previously documented in academic literature. This comparative analysis underscores the efficacy and superiority of the proposed ensemble approach. In conclusion, this study contributes a robust, innovative, and scalable solution to the pressing challenge of fake news detection. By harnessing the capabilities of advanced machine learning techniques, the research findings pave the way for enhancing the integrity and veracity of information in an increasingly digitalized world, thereby safeguarding public trust and promoting informed discourse.
first_indexed 2024-03-08T17:13:44Z
format Article
id doaj.art-4ff23cde54d2482ab3d6f8f753cad0f9
institution Directory Open Access Journal
issn 1814-4225
2663-2012
language English
last_indexed 2024-03-08T17:13:44Z
publishDate 2023-12-01
publisher National Aerospace University «Kharkiv Aviation Institute»
record_format Article
series Радіоелектронні і комп'ютерні системи
spelling doaj.art-4ff23cde54d2482ab3d6f8f753cad0f92024-01-03T18:00:58ZengNational Aerospace University «Kharkiv Aviation Institute»Радіоелектронні і комп'ютерні системи1814-42252663-20122023-12-010451910.32620/reks.2023.4.012022Ensemble machine learning approaches for fake news classificationHalyna Padalko0Vasyl Chomko1Sergiy Yakovlev2Dmytro Chumachenko3National Aerospace University "Kharkiv Aviation Institute", Kharkiv, Ukraine; University of Waterloo, Waterloo, Canada; Balsillie School of International Affairs, Waterloo, CanadaUniversity of Waterloo, WaterlooInstitute of Information Technology, Lodz University of Technology, LodzNational Aerospace University "Kharkiv Aviation Institute", KharkivIn today’s interconnected digital landscape, the proliferation of fake news has become a significant challenge, with far-reaching implications for individuals, institutions, and societies. The rapid spread of misleading information undermines the credibility of genuine news outlets and threatens informed decision-making, public trust, and democratic processes. Recognizing the profound relevance and urgency of addressing this issue, this research embarked on a mission to harness the power of machine learning to combat fake news menace. This study develops an ensemble machine learning model for fake news classification. The research is targeted at spreading fake news. The research subjects are machine learning methods for misinformation classification. Methods: we employed three state-of-the-art algorithms: LightGBM, XGBoost, and Balanced Random Forest (BRF). Each model was meticulously trained on a comprehensive dataset curated to encompass a diverse range of news articles, ensuring a broad representation of linguistic patterns and styles. A distinctive feature of the proposed approach is the emphasis on token importance. By leveraging specific tokens that exhibited a high degree of influence on classification outcomes, we enhanced the precision and reliability of the developed models. The empirical results were both promising and illuminating. The LightGBM model emerged as the top performer among the three, registering an impressive F1-score of 97.74% and an accuracy rate of 97.64%. Notably, all three of the proposed models consistently outperformed several existing models previously documented in academic literature. This comparative analysis underscores the efficacy and superiority of the proposed ensemble approach. In conclusion, this study contributes a robust, innovative, and scalable solution to the pressing challenge of fake news detection. By harnessing the capabilities of advanced machine learning techniques, the research findings pave the way for enhancing the integrity and veracity of information in an increasingly digitalized world, thereby safeguarding public trust and promoting informed discourse.http://nti.khai.edu/ojs/index.php/reks/article/view/2181fake newsclassificationmisinformationdisinformationbalanced random forestxgboostlightgbmwelfakemachine learning
spellingShingle Halyna Padalko
Vasyl Chomko
Sergiy Yakovlev
Dmytro Chumachenko
Ensemble machine learning approaches for fake news classification
Радіоелектронні і комп'ютерні системи
fake news
classification
misinformation
disinformation
balanced random forest
xgboost
lightgbm
welfake
machine learning
title Ensemble machine learning approaches for fake news classification
title_full Ensemble machine learning approaches for fake news classification
title_fullStr Ensemble machine learning approaches for fake news classification
title_full_unstemmed Ensemble machine learning approaches for fake news classification
title_short Ensemble machine learning approaches for fake news classification
title_sort ensemble machine learning approaches for fake news classification
topic fake news
classification
misinformation
disinformation
balanced random forest
xgboost
lightgbm
welfake
machine learning
url http://nti.khai.edu/ojs/index.php/reks/article/view/2181
work_keys_str_mv AT halynapadalko ensemblemachinelearningapproachesforfakenewsclassification
AT vasylchomko ensemblemachinelearningapproachesforfakenewsclassification
AT sergiyyakovlev ensemblemachinelearningapproachesforfakenewsclassification
AT dmytrochumachenko ensemblemachinelearningapproachesforfakenewsclassification