A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification

An essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationshi...

Full description

Bibliographic Details
Main Authors: Hamed Khataei Maragheh, Farhad Soleimanian Gharehchopogh, Kambiz Majidzadeh, Amin Babazadeh Sangar
Format: Article
Language:English
Published: MDPI AG 2022-02-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/10/3/488
_version_ 1797486280102117376
author Hamed Khataei Maragheh
Farhad Soleimanian Gharehchopogh
Kambiz Majidzadeh
Amin Babazadeh Sangar
author_facet Hamed Khataei Maragheh
Farhad Soleimanian Gharehchopogh
Kambiz Majidzadeh
Amin Babazadeh Sangar
author_sort Hamed Khataei Maragheh
collection DOAJ
description An essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationships between data. With the development of deep learning algorithms, many authors have used deep learning in MLTC. In this paper, a novel model called Spotted Hyena Optimizer (SHO)-Long Short-Term Memory (SHO-LSTM) for MLTC based on LSTM network and SHO algorithm is proposed. In the LSTM network, the Skip-gram method is used to embed words into the vector space. The new model uses the SHO algorithm to optimize the initial weight of the LSTM network. Adjusting the weight matrix in LSTM is a major challenge. If the weight of the neurons to be accurate, then the accuracy of the output will be higher. The SHO algorithm is a population-based meta-heuristic algorithm that works based on the mass hunting behavior of spotted hyenas. In this algorithm, each solution of the problem is coded as a hyena. Then the hyenas are approached to the optimal answer by following the hyena of the leader. Four datasets are used (RCV1-v2, EUR-Lex, Reuters-21578, and Bookmarks) to evaluate the proposed model. The assessments demonstrate that the proposed model has a higher accuracy rate than LSTM, Genetic Algorithm-LSTM (GA-LSTM), Particle Swarm Optimization-LSTM (PSO-LSTM), Artificial Bee Colony-LSTM (ABC-LSTM), Harmony Algorithm Search-LSTM (HAS-LSTM), and Differential Evolution-LSTM (DE-LSTM). The improvement of SHO-LSTM model accuracy for four datasets compared to LSTM is 7.52%, 7.12%, 1.92%, and 4.90%, respectively.
first_indexed 2024-03-09T23:31:01Z
format Article
id doaj.art-31e59f5ea76c42eb95b38a84e6eaf9b1
institution Directory Open Access Journal
issn 2227-7390
language English
last_indexed 2024-03-09T23:31:01Z
publishDate 2022-02-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj.art-31e59f5ea76c42eb95b38a84e6eaf9b12023-11-23T17:08:20ZengMDPI AGMathematics2227-73902022-02-0110348810.3390/math10030488A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text ClassificationHamed Khataei Maragheh0Farhad Soleimanian Gharehchopogh1Kambiz Majidzadeh2Amin Babazadeh Sangar3Department of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranAn essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationships between data. With the development of deep learning algorithms, many authors have used deep learning in MLTC. In this paper, a novel model called Spotted Hyena Optimizer (SHO)-Long Short-Term Memory (SHO-LSTM) for MLTC based on LSTM network and SHO algorithm is proposed. In the LSTM network, the Skip-gram method is used to embed words into the vector space. The new model uses the SHO algorithm to optimize the initial weight of the LSTM network. Adjusting the weight matrix in LSTM is a major challenge. If the weight of the neurons to be accurate, then the accuracy of the output will be higher. The SHO algorithm is a population-based meta-heuristic algorithm that works based on the mass hunting behavior of spotted hyenas. In this algorithm, each solution of the problem is coded as a hyena. Then the hyenas are approached to the optimal answer by following the hyena of the leader. Four datasets are used (RCV1-v2, EUR-Lex, Reuters-21578, and Bookmarks) to evaluate the proposed model. The assessments demonstrate that the proposed model has a higher accuracy rate than LSTM, Genetic Algorithm-LSTM (GA-LSTM), Particle Swarm Optimization-LSTM (PSO-LSTM), Artificial Bee Colony-LSTM (ABC-LSTM), Harmony Algorithm Search-LSTM (HAS-LSTM), and Differential Evolution-LSTM (DE-LSTM). The improvement of SHO-LSTM model accuracy for four datasets compared to LSTM is 7.52%, 7.12%, 1.92%, and 4.90%, respectively.https://www.mdpi.com/2227-7390/10/3/488multi-label text classificationdeep learning neural networksshort-term long-term memoryspotted hyena optimizer
spellingShingle Hamed Khataei Maragheh
Farhad Soleimanian Gharehchopogh
Kambiz Majidzadeh
Amin Babazadeh Sangar
A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
Mathematics
multi-label text classification
deep learning neural networks
short-term long-term memory
spotted hyena optimizer
title A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
title_full A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
title_fullStr A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
title_full_unstemmed A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
title_short A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
title_sort new hybrid based on long short term memory network with spotted hyena optimization algorithm for multi label text classification
topic multi-label text classification
deep learning neural networks
short-term long-term memory
spotted hyena optimizer
url https://www.mdpi.com/2227-7390/10/3/488
work_keys_str_mv AT hamedkhataeimaragheh anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT farhadsoleimaniangharehchopogh anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT kambizmajidzadeh anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT aminbabazadehsangar anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT hamedkhataeimaragheh newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT farhadsoleimaniangharehchopogh newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT kambizmajidzadeh newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification
AT aminbabazadehsangar newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification