Ensemble machine learning approaches for fake news classification

In today’s interconnected digital landscape, the proliferation of fake news has become a significant challenge, with far-reaching implications for individuals, institutions, and societies. The rapid spread of misleading information undermines the credibility of genuine news outlets and threatens inf...

Full description

Bibliographic Details
Main Authors:	Halyna Padalko, Vasyl Chomko, Sergiy Yakovlev, Dmytro Chumachenko
Format:	Article
Language:	English
Published:	National Aerospace University «Kharkiv Aviation Institute» 2023-12-01
Series:	Радіоелектронні і комп'ютерні системи
Subjects:	fake news classification misinformation disinformation balanced random forest xgboost lightgbm welfake machine learning
Online Access:	http://nti.khai.edu/ojs/index.php/reks/article/view/2181

_version_	1797367114961518592
author	Halyna Padalko Vasyl Chomko Sergiy Yakovlev Dmytro Chumachenko
author_facet	Halyna Padalko Vasyl Chomko Sergiy Yakovlev Dmytro Chumachenko
author_sort	Halyna Padalko
collection	DOAJ
description	In today’s interconnected digital landscape, the proliferation of fake news has become a significant challenge, with far-reaching implications for individuals, institutions, and societies. The rapid spread of misleading information undermines the credibility of genuine news outlets and threatens informed decision-making, public trust, and democratic processes. Recognizing the profound relevance and urgency of addressing this issue, this research embarked on a mission to harness the power of machine learning to combat fake news menace. This study develops an ensemble machine learning model for fake news classification. The research is targeted at spreading fake news. The research subjects are machine learning methods for misinformation classification. Methods: we employed three state-of-the-art algorithms: LightGBM, XGBoost, and Balanced Random Forest (BRF). Each model was meticulously trained on a comprehensive dataset curated to encompass a diverse range of news articles, ensuring a broad representation of linguistic patterns and styles. A distinctive feature of the proposed approach is the emphasis on token importance. By leveraging specific tokens that exhibited a high degree of influence on classification outcomes, we enhanced the precision and reliability of the developed models. The empirical results were both promising and illuminating. The LightGBM model emerged as the top performer among the three, registering an impressive F1-score of 97.74% and an accuracy rate of 97.64%. Notably, all three of the proposed models consistently outperformed several existing models previously documented in academic literature. This comparative analysis underscores the efficacy and superiority of the proposed ensemble approach. In conclusion, this study contributes a robust, innovative, and scalable solution to the pressing challenge of fake news detection. By harnessing the capabilities of advanced machine learning techniques, the research findings pave the way for enhancing the integrity and veracity of information in an increasingly digitalized world, thereby safeguarding public trust and promoting informed discourse.
first_indexed	2024-03-08T17:13:44Z
format	Article
id	doaj.art-4ff23cde54d2482ab3d6f8f753cad0f9
institution	Directory Open Access Journal
issn	1814-4225 2663-2012
language	English
last_indexed	2024-03-08T17:13:44Z
publishDate	2023-12-01
publisher	National Aerospace University «Kharkiv Aviation Institute»
record_format	Article
series	Радіоелектронні і комп'ютерні системи
spelling	doaj.art-4ff23cde54d2482ab3d6f8f753cad0f92024-01-03T18:00:58ZengNational Aerospace University «Kharkiv Aviation Institute»Радіоелектронні і комп'ютерні системи1814-42252663-20122023-12-010451910.32620/reks.2023.4.012022Ensemble machine learning approaches for fake news classificationHalyna Padalko0Vasyl Chomko1Sergiy Yakovlev2Dmytro Chumachenko3National Aerospace University "Kharkiv Aviation Institute", Kharkiv, Ukraine; University of Waterloo, Waterloo, Canada; Balsillie School of International Affairs, Waterloo, CanadaUniversity of Waterloo, WaterlooInstitute of Information Technology, Lodz University of Technology, LodzNational Aerospace University "Kharkiv Aviation Institute", KharkivIn today’s interconnected digital landscape, the proliferation of fake news has become a significant challenge, with far-reaching implications for individuals, institutions, and societies. The rapid spread of misleading information undermines the credibility of genuine news outlets and threatens informed decision-making, public trust, and democratic processes. Recognizing the profound relevance and urgency of addressing this issue, this research embarked on a mission to harness the power of machine learning to combat fake news menace. This study develops an ensemble machine learning model for fake news classification. The research is targeted at spreading fake news. The research subjects are machine learning methods for misinformation classification. Methods: we employed three state-of-the-art algorithms: LightGBM, XGBoost, and Balanced Random Forest (BRF). Each model was meticulously trained on a comprehensive dataset curated to encompass a diverse range of news articles, ensuring a broad representation of linguistic patterns and styles. A distinctive feature of the proposed approach is the emphasis on token importance. By leveraging specific tokens that exhibited a high degree of influence on classification outcomes, we enhanced the precision and reliability of the developed models. The empirical results were both promising and illuminating. The LightGBM model emerged as the top performer among the three, registering an impressive F1-score of 97.74% and an accuracy rate of 97.64%. Notably, all three of the proposed models consistently outperformed several existing models previously documented in academic literature. This comparative analysis underscores the efficacy and superiority of the proposed ensemble approach. In conclusion, this study contributes a robust, innovative, and scalable solution to the pressing challenge of fake news detection. By harnessing the capabilities of advanced machine learning techniques, the research findings pave the way for enhancing the integrity and veracity of information in an increasingly digitalized world, thereby safeguarding public trust and promoting informed discourse.http://nti.khai.edu/ojs/index.php/reks/article/view/2181fake newsclassificationmisinformationdisinformationbalanced random forestxgboostlightgbmwelfakemachine learning
spellingShingle	Halyna Padalko Vasyl Chomko Sergiy Yakovlev Dmytro Chumachenko Ensemble machine learning approaches for fake news classification Радіоелектронні і комп'ютерні системи fake news classification misinformation disinformation balanced random forest xgboost lightgbm welfake machine learning
title	Ensemble machine learning approaches for fake news classification
title_full	Ensemble machine learning approaches for fake news classification
title_fullStr	Ensemble machine learning approaches for fake news classification
title_full_unstemmed	Ensemble machine learning approaches for fake news classification
title_short	Ensemble machine learning approaches for fake news classification
title_sort	ensemble machine learning approaches for fake news classification
topic	fake news classification misinformation disinformation balanced random forest xgboost lightgbm welfake machine learning
url	http://nti.khai.edu/ojs/index.php/reks/article/view/2181
work_keys_str_mv	AT halynapadalko ensemblemachinelearningapproachesforfakenewsclassification AT vasylchomko ensemblemachinelearningapproachesforfakenewsclassification AT sergiyyakovlev ensemblemachinelearningapproachesforfakenewsclassification AT dmytrochumachenko ensemblemachinelearningapproachesforfakenewsclassification

Ensemble machine learning approaches for fake news classification

Similar Items