A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews

Feature extraction and selection are critical in sentiment analysis (SA) to extract and select only the appropriate features by removing those deemed redundant. As such, the successful implementation of this process leads to better classification accuracy. Inevitably, selecting high-quality minimal...

Full description

Bibliographic Details
Main Authors: Harnani Mat Zin, Norwati Mustapha, Masrah Azrifah Azmi Murad, Nurfadhlina Mohd Sharef
Format: Article
Language:English
Published: UUM Press 2022-10-01
Series:Journal of ICT
Subjects:
Online Access:https://e-journal.uum.edu.my/index.php/jict/article/view/14428
_version_ 1811341236525596672
author Harnani Mat Zin
Norwati Mustapha
Masrah Azrifah Azmi Murad
Nurfadhlina Mohd Sharef
author_facet Harnani Mat Zin
Norwati Mustapha
Masrah Azrifah Azmi Murad
Nurfadhlina Mohd Sharef
author_sort Harnani Mat Zin
collection DOAJ
description Feature extraction and selection are critical in sentiment analysis (SA) to extract and select only the appropriate features by removing those deemed redundant. As such, the successful implementation of this process leads to better classification accuracy. Inevitably, selecting high-quality minimal features can be challenging given the inherent complication in dealing with over-fitting issues. Most of the current studies used a heuristic method to perform the classification process that will result in selecting and examining only a single feature subset, while ignoring the other subsets that might give better results. This study explored the effect of using the meta-heuristic method together with the ensemble classification method in the sentiment classification of online reviews. Adding to that point, the extraction and selection of relevant features used feature ranking, hyper-parameter optimization, crossover, and mutation, while the classification process utilized the ensemble classifier. The proposed method was tested on the polarity movie review dataset v2.0 and product review dataset (books, electronics, kitchen, and music). The test results indicated that the proposed method significantly improved the classification results by 94%, which far exceeded the existing method. Therefore, the proposed feature extraction and selection method can help in improving the performance of SA in online reviews and, at the same time, reduce the number of extracted features. 
first_indexed 2024-04-13T18:53:33Z
format Article
id doaj.art-57e28814d4c94924bea005ffaf005106
institution Directory Open Access Journal
issn 1675-414X
2180-3862
language English
last_indexed 2024-04-13T18:53:33Z
publishDate 2022-10-01
publisher UUM Press
record_format Article
series Journal of ICT
spelling doaj.art-57e28814d4c94924bea005ffaf0051062022-12-22T02:34:19ZengUUM PressJournal of ICT1675-414X2180-38622022-10-0121410.32890/jict2022.21.4.5A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online ReviewsHarnani Mat Zin0Norwati Mustapha1Masrah Azrifah Azmi Murad2Nurfadhlina Mohd Sharef3Computing Department, Faculty of Computing, Arts & Creative Industry, Universiti Pendidikan Sultan Idris, MalaysiaDepartment of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, MalaysiaDepartment of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, MalaysiaDepartment of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Malaysia Feature extraction and selection are critical in sentiment analysis (SA) to extract and select only the appropriate features by removing those deemed redundant. As such, the successful implementation of this process leads to better classification accuracy. Inevitably, selecting high-quality minimal features can be challenging given the inherent complication in dealing with over-fitting issues. Most of the current studies used a heuristic method to perform the classification process that will result in selecting and examining only a single feature subset, while ignoring the other subsets that might give better results. This study explored the effect of using the meta-heuristic method together with the ensemble classification method in the sentiment classification of online reviews. Adding to that point, the extraction and selection of relevant features used feature ranking, hyper-parameter optimization, crossover, and mutation, while the classification process utilized the ensemble classifier. The proposed method was tested on the polarity movie review dataset v2.0 and product review dataset (books, electronics, kitchen, and music). The test results indicated that the proposed method significantly improved the classification results by 94%, which far exceeded the existing method. Therefore, the proposed feature extraction and selection method can help in improving the performance of SA in online reviews and, at the same time, reduce the number of extracted features.  https://e-journal.uum.edu.my/index.php/jict/article/view/14428Feature extractionfeature selectiononline reviewsmeta-heuristicssentiment analysis
spellingShingle Harnani Mat Zin
Norwati Mustapha
Masrah Azrifah Azmi Murad
Nurfadhlina Mohd Sharef
A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews
Journal of ICT
Feature extraction
feature selection
online reviews
meta-heuristics
sentiment analysis
title A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews
title_full A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews
title_fullStr A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews
title_full_unstemmed A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews
title_short A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews
title_sort meta heuristic algorithm for the minimal high quality feature extraction of online reviews
topic Feature extraction
feature selection
online reviews
meta-heuristics
sentiment analysis
url https://e-journal.uum.edu.my/index.php/jict/article/view/14428
work_keys_str_mv AT harnanimatzin ametaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT norwatimustapha ametaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT masrahazrifahazmimurad ametaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT nurfadhlinamohdsharef ametaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT harnanimatzin metaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT norwatimustapha metaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT masrahazrifahazmimurad metaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews
AT nurfadhlinamohdsharef metaheuristicalgorithmfortheminimalhighqualityfeatureextractionofonlinereviews