Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach

Social media has become a popular platform for accessing and sharing news, but it has also led to a rise in fake news, posing serious risks. The ease of dissemination and constant flow of information raise concerns about the spread of incorrect information. Timely verification of news is crucial to...

Full description

Bibliographic Details
Main Authors: Rubab Roshan, Irfan Ali Bhacho, Sammer Zai
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Engineering Proceedings
Subjects:
Online Access:https://www.mdpi.com/2673-4591/46/1/5
_version_ 1797381168635576320
author Rubab Roshan
Irfan Ali Bhacho
Sammer Zai
author_facet Rubab Roshan
Irfan Ali Bhacho
Sammer Zai
author_sort Rubab Roshan
collection DOAJ
description Social media has become a popular platform for accessing and sharing news, but it has also led to a rise in fake news, posing serious risks. The ease of dissemination and constant flow of information raise concerns about the spread of incorrect information. Timely verification of news is crucial to combat false news. However, most research on false news identification has focused on English, neglecting South Asian languages. This study examines a dataset of Sindhi tweets, employing text feature extraction techniques such as TF–IDF and hashing vectorizer. Several machine learning algorithms, along with advanced deep learning models such as Transformer BERT, were utilized for analysis.
first_indexed 2024-03-08T20:47:29Z
format Article
id doaj.art-1b8f355d98fd43dbb6b6ff687320f253
institution Directory Open Access Journal
issn 2673-4591
language English
last_indexed 2024-03-08T20:47:29Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Engineering Proceedings
spelling doaj.art-1b8f355d98fd43dbb6b6ff687320f2532023-12-22T14:07:01ZengMDPI AGEngineering Proceedings2673-45912023-09-01461510.3390/engproc2023046005Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning ApproachRubab Roshan0Irfan Ali Bhacho1Sammer Zai2Department of Computer Systems Engineering, Mehran University of Engineering and Technology (MUET), Jamshoro 76062, PakistanDepartment of Computer Systems Engineering, Mehran University of Engineering and Technology (MUET), Jamshoro 76062, PakistanDepartment of Computer Systems Engineering, Mehran University of Engineering and Technology (MUET), Jamshoro 76062, PakistanSocial media has become a popular platform for accessing and sharing news, but it has also led to a rise in fake news, posing serious risks. The ease of dissemination and constant flow of information raise concerns about the spread of incorrect information. Timely verification of news is crucial to combat false news. However, most research on false news identification has focused on English, neglecting South Asian languages. This study examines a dataset of Sindhi tweets, employing text feature extraction techniques such as TF–IDF and hashing vectorizer. Several machine learning algorithms, along with advanced deep learning models such as Transformer BERT, were utilized for analysis.https://www.mdpi.com/2673-4591/46/1/5machine learningdeep learningBERTtext miningTF–IDFhashing vectorizer
spellingShingle Rubab Roshan
Irfan Ali Bhacho
Sammer Zai
Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
Engineering Proceedings
machine learning
deep learning
BERT
text mining
TF–IDF
hashing vectorizer
title Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
title_full Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
title_fullStr Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
title_full_unstemmed Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
title_short Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach
title_sort comparative analysis of tf idf and hashing vectorizer for fake news detection in sindhi a machine learning and deep learning approach
topic machine learning
deep learning
BERT
text mining
TF–IDF
hashing vectorizer
url https://www.mdpi.com/2673-4591/46/1/5
work_keys_str_mv AT rubabroshan comparativeanalysisoftfidfandhashingvectorizerforfakenewsdetectioninsindhiamachinelearninganddeeplearningapproach
AT irfanalibhacho comparativeanalysisoftfidfandhashingvectorizerforfakenewsdetectioninsindhiamachinelearninganddeeplearningapproach
AT sammerzai comparativeanalysisoftfidfandhashingvectorizerforfakenewsdetectioninsindhiamachinelearninganddeeplearningapproach