Text Classification Based on Neural Network Fusion

The goal of text classification is to identify the category to which the text belongs. Text categorization is widely used in email detection, sentiment analysis, topic marking and other fields. However, good text representation is the point to improve the capability of NLP tasks. Traditional text re...

Full description

Bibliographic Details
Main Author: Deageon Kim
Format: Article
Language:English
Published: University North 2023-01-01
Series:Tehnički Glasnik
Subjects:
Online Access:https://hrcak.srce.hr/file/441847
_version_ 1797206415082782720
author Deageon Kim
author_facet Deageon Kim
author_sort Deageon Kim
collection DOAJ
description The goal of text classification is to identify the category to which the text belongs. Text categorization is widely used in email detection, sentiment analysis, topic marking and other fields. However, good text representation is the point to improve the capability of NLP tasks. Traditional text representation adopts bag-of-words model or vector space model, which loses the context information of the text and faces the problems of high latitude and high sparsity,. In recent years, with the increase of data and the improvement of computing performance, the use of deep learning technology to represent and classify texts has attracted great attention. Convolutional Neural Network (CNN), Recurrent Neural Network (RNN) and RNN with attention mechanism are used to represent the text, and then to classify the text and other NLP tasks, all of which have better performance than the traditional methods. In this paper, we design two sentence-level models based on the deep network and the details are as follows: (1) Text representation and classification model based on bidirectional RNN and CNN (BRCNN). BRCNN’s input is the word vector corresponding to each word in the sentence; after using RNN to extract word order information in sentences, CNN is used to extract higher-level features of sentences. After convolution, the maximum pool operation is used to obtain sentence vectors. At last, softmax classifier is used for classification. RNN can capture the word order information in sentences, while CNN can extract useful features. Experiments on eight text classification tasks show that BRCNN model can get better text feature representation, and the classification accuracy rate is equal to or higher than that of the prior art. (2) Attention mechanism and CNN (ACNN) model uses the RNN with attention mechanism to obtain the context vector; Then CNN is used to extract more advanced feature information. The maximum pool operation is adopted to obtain a sentence vector; At last, the softmax classifier is used to classify the text. Experiments on eight text classification benchmark data sets show that ACNN improves the stability of model convergence, and can converge to an optimal or local optimal solution better than BRCNN.
first_indexed 2024-04-24T09:06:39Z
format Article
id doaj.art-df697b58967a4e1b9efda548c8ee9928
institution Directory Open Access Journal
issn 1846-6168
1848-5588
language English
last_indexed 2024-04-24T09:06:39Z
publishDate 2023-01-01
publisher University North
record_format Article
series Tehnički Glasnik
spelling doaj.art-df697b58967a4e1b9efda548c8ee99282024-04-15T18:46:14ZengUniversity NorthTehnički Glasnik1846-61681848-55882023-01-0117335936610.31803/tg-20221228154330Text Classification Based on Neural Network FusionDeageon Kim0Architectural Engineering, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan, 47011, Republic of KoreaThe goal of text classification is to identify the category to which the text belongs. Text categorization is widely used in email detection, sentiment analysis, topic marking and other fields. However, good text representation is the point to improve the capability of NLP tasks. Traditional text representation adopts bag-of-words model or vector space model, which loses the context information of the text and faces the problems of high latitude and high sparsity,. In recent years, with the increase of data and the improvement of computing performance, the use of deep learning technology to represent and classify texts has attracted great attention. Convolutional Neural Network (CNN), Recurrent Neural Network (RNN) and RNN with attention mechanism are used to represent the text, and then to classify the text and other NLP tasks, all of which have better performance than the traditional methods. In this paper, we design two sentence-level models based on the deep network and the details are as follows: (1) Text representation and classification model based on bidirectional RNN and CNN (BRCNN). BRCNN’s input is the word vector corresponding to each word in the sentence; after using RNN to extract word order information in sentences, CNN is used to extract higher-level features of sentences. After convolution, the maximum pool operation is used to obtain sentence vectors. At last, softmax classifier is used for classification. RNN can capture the word order information in sentences, while CNN can extract useful features. Experiments on eight text classification tasks show that BRCNN model can get better text feature representation, and the classification accuracy rate is equal to or higher than that of the prior art. (2) Attention mechanism and CNN (ACNN) model uses the RNN with attention mechanism to obtain the context vector; Then CNN is used to extract more advanced feature information. The maximum pool operation is adopted to obtain a sentence vector; At last, the softmax classifier is used to classify the text. Experiments on eight text classification benchmark data sets show that ACNN improves the stability of model convergence, and can converge to an optimal or local optimal solution better than BRCNN.https://hrcak.srce.hr/file/441847attention mechanismdeep learningneural networkpresentationtext classification
spellingShingle Deageon Kim
Text Classification Based on Neural Network Fusion
Tehnički Glasnik
attention mechanism
deep learning
neural network
presentation
text classification
title Text Classification Based on Neural Network Fusion
title_full Text Classification Based on Neural Network Fusion
title_fullStr Text Classification Based on Neural Network Fusion
title_full_unstemmed Text Classification Based on Neural Network Fusion
title_short Text Classification Based on Neural Network Fusion
title_sort text classification based on neural network fusion
topic attention mechanism
deep learning
neural network
presentation
text classification
url https://hrcak.srce.hr/file/441847
work_keys_str_mv AT deageonkim textclassificationbasedonneuralnetworkfusion