A Hybrid Linguistic and Knowledge-Based Analysis Approach for Fake News Detection on Social Media

The rapid development of different social media and content-sharing platforms has been largely exploited to spread misinformation and fake news that make people believing in harmful stories, which allow to influence public opinion, and could cause panic and chaos among population. Thus, fake news de...

Full description

Bibliographic Details
Main Authors: Noureddine Seddari, Abdelouahid Derhab, Mohamed Belaoued, Waleed Halboob, Jalal Al-Muhtadi, Abdelghani Bouras
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9791232/
Description
Summary:The rapid development of different social media and content-sharing platforms has been largely exploited to spread misinformation and fake news that make people believing in harmful stories, which allow to influence public opinion, and could cause panic and chaos among population. Thus, fake news detection has become an important research topic, aiming at flagging a specific content as fake or legitimate. The fake news detection solutions can be divided into three main categories: content-based, social context-based, and knowledge-based approaches. In this paper, we propose a novel hybrid fake news detection system that combines linguistic and knowledge-based approaches and inherits their advantages, by employing two different sets of features: (1) linguistic features (i.e., title, number of words, reading ease, lexical diversity,and sentiment), and (2) a novel set of knowledge-based features, called <italic>fact-verification</italic> features that comprise three types of information namely, (i) <italic>reputation of the website</italic> where the news is published, (ii) <italic>coverage</italic>, i.e., number of sources that published the news, and (iii) <italic>fact-check</italic>, i.e., opinion of well-known fact-checking websites about the news, i.e., true or false. The proposed system only employs eight features, which is less than most of the state-of-the-art approaches. Also, the evaluation results on a fake news dataset show that the proposed system employing both types of features can reach an accuracy of 94.4&#x0025;, which is better compared to that obtained from separately employing linguistic features (i.e., accuracy=89.4&#x0025;) and fact-verification features (i.e., accuracy=81.2&#x0025;).
ISSN:2169-3536