Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
Deep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-04-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/12/7/3641 |
_version_ | 1827624261485854720 |
---|---|
author | Bilal Ahmed Chandio Ali Shariq Imran Maheen Bakhtyar Sher Muhammad Daudpota Junaid Baber |
author_facet | Bilal Ahmed Chandio Ali Shariq Imran Maheen Bakhtyar Sher Muhammad Daudpota Junaid Baber |
author_sort | Bilal Ahmed Chandio |
collection | DOAJ |
description | Deep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep networks are not reliable in preserving contextual information. This may not harm the results in the case of image processing where we do not care about the sequence, but when we consider the data collected from text for processing, such networks may trigger disastrous results. Moreover, establishing sentence semantics in a colloquial text such as Roman Urdu is a challenge. Additionally, the sparsity and high dimensionality of data in such informal text have encountered a significant challenge for building sentence semantics. To overcome this problem, we propose a deep recurrent architecture RU-BiLSTM based on bidirectional LSTM (BiLSTM) coupled with word embedding and an attention mechanism for sentiment analysis of Roman Urdu. Our proposed model uses the bidirectional LSTM to preserve the context in both directions and the attention mechanism to concentrate on more important features. Eventually, the last dense softmax output layer is used to acquire the binary and ternary classification results. We empirically evaluated our model on two available datasets of Roman Urdu, i.e., RUECD and RUSA-19. Our proposed model outperformed the baseline models on many grounds, and a significant improvement of 6% to 8% is achieved over baseline models. |
first_indexed | 2024-03-09T12:05:18Z |
format | Article |
id | doaj.art-0714fce452e748dea3b95cf9f63445e5 |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-09T12:05:18Z |
publishDate | 2022-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-0714fce452e748dea3b95cf9f63445e52023-11-30T22:58:32ZengMDPI AGApplied Sciences2076-34172022-04-01127364110.3390/app12073641Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman UrduBilal Ahmed Chandio0Ali Shariq Imran1Maheen Bakhtyar2Sher Muhammad Daudpota3Junaid Baber4Department of Computer Science and Information Technology, University of Balochistan, Quetta 87300, PakistanDepartment of Computer Science (IDI), Norwegian University of Science and Technology (NTNU), 2815 Gjøvik, NorwayDepartment of Computer Science and Information Technology, University of Balochistan, Quetta 87300, PakistanDepartment of Computer Science, Sukkur IBA University, Sukkur 65200, PakistanDepartment of Computer Science and Information Technology, University of Balochistan, Quetta 87300, PakistanDeep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep networks are not reliable in preserving contextual information. This may not harm the results in the case of image processing where we do not care about the sequence, but when we consider the data collected from text for processing, such networks may trigger disastrous results. Moreover, establishing sentence semantics in a colloquial text such as Roman Urdu is a challenge. Additionally, the sparsity and high dimensionality of data in such informal text have encountered a significant challenge for building sentence semantics. To overcome this problem, we propose a deep recurrent architecture RU-BiLSTM based on bidirectional LSTM (BiLSTM) coupled with word embedding and an attention mechanism for sentiment analysis of Roman Urdu. Our proposed model uses the bidirectional LSTM to preserve the context in both directions and the attention mechanism to concentrate on more important features. Eventually, the last dense softmax output layer is used to acquire the binary and ternary classification results. We empirically evaluated our model on two available datasets of Roman Urdu, i.e., RUECD and RUSA-19. Our proposed model outperformed the baseline models on many grounds, and a significant improvement of 6% to 8% is achieved over baseline models.https://www.mdpi.com/2076-3417/12/7/3641sentiment analysisdeep learningRoman Urduneural networksLSTMattention networks |
spellingShingle | Bilal Ahmed Chandio Ali Shariq Imran Maheen Bakhtyar Sher Muhammad Daudpota Junaid Baber Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu Applied Sciences sentiment analysis deep learning Roman Urdu neural networks LSTM attention networks |
title | Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu |
title_full | Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu |
title_fullStr | Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu |
title_full_unstemmed | Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu |
title_short | Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu |
title_sort | attention based ru bilstm sentiment analysis model for roman urdu |
topic | sentiment analysis deep learning Roman Urdu neural networks LSTM attention networks |
url | https://www.mdpi.com/2076-3417/12/7/3641 |
work_keys_str_mv | AT bilalahmedchandio attentionbasedrubilstmsentimentanalysismodelforromanurdu AT alishariqimran attentionbasedrubilstmsentimentanalysismodelforromanurdu AT maheenbakhtyar attentionbasedrubilstmsentimentanalysismodelforromanurdu AT shermuhammaddaudpota attentionbasedrubilstmsentimentanalysismodelforromanurdu AT junaidbaber attentionbasedrubilstmsentimentanalysismodelforromanurdu |