Exploiting Stacked Autoencoders for Improved Sentiment Analysis

Sentiment analysis is an ongoing research field within the discipline of data mining. The majority of academics employ deep learning models for sentiment analysis due to their ability to self-learn and process vast amounts of data. However, the performance of deep learning models depends on the valu...

Full description

Bibliographic Details
Main Authors: Kanwal Ahmed, Muhammad Imran Nadeem, Dun Li, Zhiyun Zheng, Yazeed Yasin Ghadi, Muhammad Assam, Heba G. Mohamed
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/23/12380
_version_ 1827643381839298560
author Kanwal Ahmed
Muhammad Imran Nadeem
Dun Li
Zhiyun Zheng
Yazeed Yasin Ghadi
Muhammad Assam
Heba G. Mohamed
author_facet Kanwal Ahmed
Muhammad Imran Nadeem
Dun Li
Zhiyun Zheng
Yazeed Yasin Ghadi
Muhammad Assam
Heba G. Mohamed
author_sort Kanwal Ahmed
collection DOAJ
description Sentiment analysis is an ongoing research field within the discipline of data mining. The majority of academics employ deep learning models for sentiment analysis due to their ability to self-learn and process vast amounts of data. However, the performance of deep learning models depends on the values of the hyperparameters. Determining suitable values for hyperparameters is a cumbersome task. The goal of this study is to increase the accuracy of stacked autoencoders for sentiment analysis using a heuristic optimization approach. In this study, we propose a hybrid model GA(SAE)-SVM using a genetic algorithm (GA), stacked autoencoder (SAE), and support vector machine (SVM) for fine-grained sentiment analysis. Features are extracted using continuous bag-of-words (CBOW), and then input into the SAE. In the proposed GA(SAE)-SVM, the hyperparameters of the SAE algorithm are optimized using GA. The features extracted by SAE are input into the SVM for final classification. A comparison is performed with a random search and grid search for parameter optimization. GA optimization is faster than grid search, and selects more optimal values than random search, resulting in improved accuracy. We evaluate the performance of the proposed model on eight benchmark datasets. The proposed model outperformed when compared to the baseline and state-of-the-art techniques.
first_indexed 2024-03-09T17:52:37Z
format Article
id doaj.art-b55a5a9e496942adb8e74ba3d75079e8
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T17:52:37Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-b55a5a9e496942adb8e74ba3d75079e82023-11-24T10:35:32ZengMDPI AGApplied Sciences2076-34172022-12-0112231238010.3390/app122312380Exploiting Stacked Autoencoders for Improved Sentiment AnalysisKanwal Ahmed0Muhammad Imran Nadeem1Dun Li2Zhiyun Zheng3Yazeed Yasin Ghadi4Muhammad Assam5Heba G. Mohamed6School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaSchool of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, ChinaDepartment of Computer Science and Software Engineering, Al Ain University, Al Ain 15551, United Arab EmiratesDepartment of Software Engineering, University of Science and Technology Bannu, Bannu 28100, PakistanDepartment of Electrical Engineering, College of Engineering, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi ArabiaSentiment analysis is an ongoing research field within the discipline of data mining. The majority of academics employ deep learning models for sentiment analysis due to their ability to self-learn and process vast amounts of data. However, the performance of deep learning models depends on the values of the hyperparameters. Determining suitable values for hyperparameters is a cumbersome task. The goal of this study is to increase the accuracy of stacked autoencoders for sentiment analysis using a heuristic optimization approach. In this study, we propose a hybrid model GA(SAE)-SVM using a genetic algorithm (GA), stacked autoencoder (SAE), and support vector machine (SVM) for fine-grained sentiment analysis. Features are extracted using continuous bag-of-words (CBOW), and then input into the SAE. In the proposed GA(SAE)-SVM, the hyperparameters of the SAE algorithm are optimized using GA. The features extracted by SAE are input into the SVM for final classification. A comparison is performed with a random search and grid search for parameter optimization. GA optimization is faster than grid search, and selects more optimal values than random search, resulting in improved accuracy. We evaluate the performance of the proposed model on eight benchmark datasets. The proposed model outperformed when compared to the baseline and state-of-the-art techniques.https://www.mdpi.com/2076-3417/12/23/12380data miningnatural language processingtext miningtext analysisweb mining
spellingShingle Kanwal Ahmed
Muhammad Imran Nadeem
Dun Li
Zhiyun Zheng
Yazeed Yasin Ghadi
Muhammad Assam
Heba G. Mohamed
Exploiting Stacked Autoencoders for Improved Sentiment Analysis
Applied Sciences
data mining
natural language processing
text mining
text analysis
web mining
title Exploiting Stacked Autoencoders for Improved Sentiment Analysis
title_full Exploiting Stacked Autoencoders for Improved Sentiment Analysis
title_fullStr Exploiting Stacked Autoencoders for Improved Sentiment Analysis
title_full_unstemmed Exploiting Stacked Autoencoders for Improved Sentiment Analysis
title_short Exploiting Stacked Autoencoders for Improved Sentiment Analysis
title_sort exploiting stacked autoencoders for improved sentiment analysis
topic data mining
natural language processing
text mining
text analysis
web mining
url https://www.mdpi.com/2076-3417/12/23/12380
work_keys_str_mv AT kanwalahmed exploitingstackedautoencodersforimprovedsentimentanalysis
AT muhammadimrannadeem exploitingstackedautoencodersforimprovedsentimentanalysis
AT dunli exploitingstackedautoencodersforimprovedsentimentanalysis
AT zhiyunzheng exploitingstackedautoencodersforimprovedsentimentanalysis
AT yazeedyasinghadi exploitingstackedautoencodersforimprovedsentimentanalysis
AT muhammadassam exploitingstackedautoencodersforimprovedsentimentanalysis
AT hebagmohamed exploitingstackedautoencodersforimprovedsentimentanalysis