Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu

Deep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep...

Full description

Bibliographic Details
Main Authors: Bilal Ahmed Chandio, Ali Shariq Imran, Maheen Bakhtyar, Sher Muhammad Daudpota, Junaid Baber
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/7/3641
_version_ 1827624261485854720
author Bilal Ahmed Chandio
Ali Shariq Imran
Maheen Bakhtyar
Sher Muhammad Daudpota
Junaid Baber
author_facet Bilal Ahmed Chandio
Ali Shariq Imran
Maheen Bakhtyar
Sher Muhammad Daudpota
Junaid Baber
author_sort Bilal Ahmed Chandio
collection DOAJ
description Deep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep networks are not reliable in preserving contextual information. This may not harm the results in the case of image processing where we do not care about the sequence, but when we consider the data collected from text for processing, such networks may trigger disastrous results. Moreover, establishing sentence semantics in a colloquial text such as Roman Urdu is a challenge. Additionally, the sparsity and high dimensionality of data in such informal text have encountered a significant challenge for building sentence semantics. To overcome this problem, we propose a deep recurrent architecture RU-BiLSTM based on bidirectional LSTM (BiLSTM) coupled with word embedding and an attention mechanism for sentiment analysis of Roman Urdu. Our proposed model uses the bidirectional LSTM to preserve the context in both directions and the attention mechanism to concentrate on more important features. Eventually, the last dense softmax output layer is used to acquire the binary and ternary classification results. We empirically evaluated our model on two available datasets of Roman Urdu, i.e., RUECD and RUSA-19. Our proposed model outperformed the baseline models on many grounds, and a significant improvement of 6% to 8% is achieved over baseline models.
first_indexed 2024-03-09T12:05:18Z
format Article
id doaj.art-0714fce452e748dea3b95cf9f63445e5
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T12:05:18Z
publishDate 2022-04-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-0714fce452e748dea3b95cf9f63445e52023-11-30T22:58:32ZengMDPI AGApplied Sciences2076-34172022-04-01127364110.3390/app12073641Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman UrduBilal Ahmed Chandio0Ali Shariq Imran1Maheen Bakhtyar2Sher Muhammad Daudpota3Junaid Baber4Department of Computer Science and Information Technology, University of Balochistan, Quetta 87300, PakistanDepartment of Computer Science (IDI), Norwegian University of Science and Technology (NTNU), 2815 Gjøvik, NorwayDepartment of Computer Science and Information Technology, University of Balochistan, Quetta 87300, PakistanDepartment of Computer Science, Sukkur IBA University, Sukkur 65200, PakistanDepartment of Computer Science and Information Technology, University of Balochistan, Quetta 87300, PakistanDeep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep networks are not reliable in preserving contextual information. This may not harm the results in the case of image processing where we do not care about the sequence, but when we consider the data collected from text for processing, such networks may trigger disastrous results. Moreover, establishing sentence semantics in a colloquial text such as Roman Urdu is a challenge. Additionally, the sparsity and high dimensionality of data in such informal text have encountered a significant challenge for building sentence semantics. To overcome this problem, we propose a deep recurrent architecture RU-BiLSTM based on bidirectional LSTM (BiLSTM) coupled with word embedding and an attention mechanism for sentiment analysis of Roman Urdu. Our proposed model uses the bidirectional LSTM to preserve the context in both directions and the attention mechanism to concentrate on more important features. Eventually, the last dense softmax output layer is used to acquire the binary and ternary classification results. We empirically evaluated our model on two available datasets of Roman Urdu, i.e., RUECD and RUSA-19. Our proposed model outperformed the baseline models on many grounds, and a significant improvement of 6% to 8% is achieved over baseline models.https://www.mdpi.com/2076-3417/12/7/3641sentiment analysisdeep learningRoman Urduneural networksLSTMattention networks
spellingShingle Bilal Ahmed Chandio
Ali Shariq Imran
Maheen Bakhtyar
Sher Muhammad Daudpota
Junaid Baber
Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
Applied Sciences
sentiment analysis
deep learning
Roman Urdu
neural networks
LSTM
attention networks
title Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
title_full Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
title_fullStr Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
title_full_unstemmed Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
title_short Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu
title_sort attention based ru bilstm sentiment analysis model for roman urdu
topic sentiment analysis
deep learning
Roman Urdu
neural networks
LSTM
attention networks
url https://www.mdpi.com/2076-3417/12/7/3641
work_keys_str_mv AT bilalahmedchandio attentionbasedrubilstmsentimentanalysismodelforromanurdu
AT alishariqimran attentionbasedrubilstmsentimentanalysismodelforromanurdu
AT maheenbakhtyar attentionbasedrubilstmsentimentanalysismodelforromanurdu
AT shermuhammaddaudpota attentionbasedrubilstmsentimentanalysismodelforromanurdu
AT junaidbaber attentionbasedrubilstmsentimentanalysismodelforromanurdu