A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification
An essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationshi...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-02-01
|
Series: | Mathematics |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-7390/10/3/488 |
_version_ | 1797486280102117376 |
---|---|
author | Hamed Khataei Maragheh Farhad Soleimanian Gharehchopogh Kambiz Majidzadeh Amin Babazadeh Sangar |
author_facet | Hamed Khataei Maragheh Farhad Soleimanian Gharehchopogh Kambiz Majidzadeh Amin Babazadeh Sangar |
author_sort | Hamed Khataei Maragheh |
collection | DOAJ |
description | An essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationships between data. With the development of deep learning algorithms, many authors have used deep learning in MLTC. In this paper, a novel model called Spotted Hyena Optimizer (SHO)-Long Short-Term Memory (SHO-LSTM) for MLTC based on LSTM network and SHO algorithm is proposed. In the LSTM network, the Skip-gram method is used to embed words into the vector space. The new model uses the SHO algorithm to optimize the initial weight of the LSTM network. Adjusting the weight matrix in LSTM is a major challenge. If the weight of the neurons to be accurate, then the accuracy of the output will be higher. The SHO algorithm is a population-based meta-heuristic algorithm that works based on the mass hunting behavior of spotted hyenas. In this algorithm, each solution of the problem is coded as a hyena. Then the hyenas are approached to the optimal answer by following the hyena of the leader. Four datasets are used (RCV1-v2, EUR-Lex, Reuters-21578, and Bookmarks) to evaluate the proposed model. The assessments demonstrate that the proposed model has a higher accuracy rate than LSTM, Genetic Algorithm-LSTM (GA-LSTM), Particle Swarm Optimization-LSTM (PSO-LSTM), Artificial Bee Colony-LSTM (ABC-LSTM), Harmony Algorithm Search-LSTM (HAS-LSTM), and Differential Evolution-LSTM (DE-LSTM). The improvement of SHO-LSTM model accuracy for four datasets compared to LSTM is 7.52%, 7.12%, 1.92%, and 4.90%, respectively. |
first_indexed | 2024-03-09T23:31:01Z |
format | Article |
id | doaj.art-31e59f5ea76c42eb95b38a84e6eaf9b1 |
institution | Directory Open Access Journal |
issn | 2227-7390 |
language | English |
last_indexed | 2024-03-09T23:31:01Z |
publishDate | 2022-02-01 |
publisher | MDPI AG |
record_format | Article |
series | Mathematics |
spelling | doaj.art-31e59f5ea76c42eb95b38a84e6eaf9b12023-11-23T17:08:20ZengMDPI AGMathematics2227-73902022-02-0110348810.3390/math10030488A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text ClassificationHamed Khataei Maragheh0Farhad Soleimanian Gharehchopogh1Kambiz Majidzadeh2Amin Babazadeh Sangar3Department of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia 5756151818, IranAn essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationships between data. With the development of deep learning algorithms, many authors have used deep learning in MLTC. In this paper, a novel model called Spotted Hyena Optimizer (SHO)-Long Short-Term Memory (SHO-LSTM) for MLTC based on LSTM network and SHO algorithm is proposed. In the LSTM network, the Skip-gram method is used to embed words into the vector space. The new model uses the SHO algorithm to optimize the initial weight of the LSTM network. Adjusting the weight matrix in LSTM is a major challenge. If the weight of the neurons to be accurate, then the accuracy of the output will be higher. The SHO algorithm is a population-based meta-heuristic algorithm that works based on the mass hunting behavior of spotted hyenas. In this algorithm, each solution of the problem is coded as a hyena. Then the hyenas are approached to the optimal answer by following the hyena of the leader. Four datasets are used (RCV1-v2, EUR-Lex, Reuters-21578, and Bookmarks) to evaluate the proposed model. The assessments demonstrate that the proposed model has a higher accuracy rate than LSTM, Genetic Algorithm-LSTM (GA-LSTM), Particle Swarm Optimization-LSTM (PSO-LSTM), Artificial Bee Colony-LSTM (ABC-LSTM), Harmony Algorithm Search-LSTM (HAS-LSTM), and Differential Evolution-LSTM (DE-LSTM). The improvement of SHO-LSTM model accuracy for four datasets compared to LSTM is 7.52%, 7.12%, 1.92%, and 4.90%, respectively.https://www.mdpi.com/2227-7390/10/3/488multi-label text classificationdeep learning neural networksshort-term long-term memoryspotted hyena optimizer |
spellingShingle | Hamed Khataei Maragheh Farhad Soleimanian Gharehchopogh Kambiz Majidzadeh Amin Babazadeh Sangar A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification Mathematics multi-label text classification deep learning neural networks short-term long-term memory spotted hyena optimizer |
title | A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification |
title_full | A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification |
title_fullStr | A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification |
title_full_unstemmed | A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification |
title_short | A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification |
title_sort | new hybrid based on long short term memory network with spotted hyena optimization algorithm for multi label text classification |
topic | multi-label text classification deep learning neural networks short-term long-term memory spotted hyena optimizer |
url | https://www.mdpi.com/2227-7390/10/3/488 |
work_keys_str_mv | AT hamedkhataeimaragheh anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT farhadsoleimaniangharehchopogh anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT kambizmajidzadeh anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT aminbabazadehsangar anewhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT hamedkhataeimaragheh newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT farhadsoleimaniangharehchopogh newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT kambizmajidzadeh newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification AT aminbabazadehsangar newhybridbasedonlongshorttermmemorynetworkwithspottedhyenaoptimizationalgorithmformultilabeltextclassification |