An integrated text-based hybrid RNN-CNN model for toxic comment classification.

Forum and social media especially Facebook, Instagram and Twitter play a vital role in the communication and become the pivotal role in expressing feelings and opinions toward certain posts and news through leaving a comment and review. The comment and review can be positive, neutral, or negative. T...

Full description

Bibliographic Details
Main Authors: Wee, Tan Chi, Kit, Chaw Jun, Yiqi, Tew, Lang, Wong Siaw, Tan, Gloria Jennis, Mohd. Suaib, Norhaida
Format: Conference or Workshop Item
Published: 2023
Subjects:
_version_ 1811132523713921024
author Wee, Tan Chi
Kit, Chaw Jun
Yiqi, Tew
Lang, Wong Siaw
Tan, Gloria Jennis
Mohd. Suaib, Norhaida
author_facet Wee, Tan Chi
Kit, Chaw Jun
Yiqi, Tew
Lang, Wong Siaw
Tan, Gloria Jennis
Mohd. Suaib, Norhaida
author_sort Wee, Tan Chi
collection ePrints
description Forum and social media especially Facebook, Instagram and Twitter play a vital role in the communication and become the pivotal role in expressing feelings and opinions toward certain posts and news through leaving a comment and review. The comment and review can be positive, neutral, or negative. There have been several longitudinal studies involving online comments showed that one of five comments, 20 percent of comments are toxic comments. Toxic comments are those negative comments that can greatly affect a person emotionally which might cause people to suffer from mental problems. Hence, toxic comments detection is fast becoming a key instrument in protecting social media platforms' users from cyberbullying. This paper seeks to remedy this issue by proposing and comparing different artificial intelligence models to classify the toxic online comments. Thus, six different models with various algorithms which are CNN, RNN, Hybrid-NN, Linear SVC, multinomial naive bayes and logistic regression will be trained with a pre-processed dataset and evaluated with cross validation and ROC AUC. These models are then compared by observing the accuracy of each model and result shows that Hybrid-Neural network, combination between RNN and CNN score the top of the chart with accuracy of 0.9778 which suggest the best model to classify the online comments.
first_indexed 2024-09-24T00:05:39Z
format Conference or Workshop Item
id utm.eprints-107363
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-09-24T00:05:39Z
publishDate 2023
record_format dspace
spelling utm.eprints-1073632024-09-03T06:26:45Z http://eprints.utm.my/107363/ An integrated text-based hybrid RNN-CNN model for toxic comment classification. Wee, Tan Chi Kit, Chaw Jun Yiqi, Tew Lang, Wong Siaw Tan, Gloria Jennis Mohd. Suaib, Norhaida T58.6-58.62 Management information systems Forum and social media especially Facebook, Instagram and Twitter play a vital role in the communication and become the pivotal role in expressing feelings and opinions toward certain posts and news through leaving a comment and review. The comment and review can be positive, neutral, or negative. There have been several longitudinal studies involving online comments showed that one of five comments, 20 percent of comments are toxic comments. Toxic comments are those negative comments that can greatly affect a person emotionally which might cause people to suffer from mental problems. Hence, toxic comments detection is fast becoming a key instrument in protecting social media platforms' users from cyberbullying. This paper seeks to remedy this issue by proposing and comparing different artificial intelligence models to classify the toxic online comments. Thus, six different models with various algorithms which are CNN, RNN, Hybrid-NN, Linear SVC, multinomial naive bayes and logistic regression will be trained with a pre-processed dataset and evaluated with cross validation and ROC AUC. These models are then compared by observing the accuracy of each model and result shows that Hybrid-Neural network, combination between RNN and CNN score the top of the chart with accuracy of 0.9778 which suggest the best model to classify the online comments. 2023-12-07 Conference or Workshop Item PeerReviewed Wee, Tan Chi and Kit, Chaw Jun and Yiqi, Tew and Lang, Wong Siaw and Tan, Gloria Jennis and Mohd. Suaib, Norhaida (2023) An integrated text-based hybrid RNN-CNN model for toxic comment classification. In: 4th Tarumanagara International Conference of the Applications of Technology and Engineering, TICATE 2021, 5 August 2021 - 6 August 2021, Jakarta, Indonesia - virtual, Online. http://dx.doi.org/10.1063/5.0125995
spellingShingle T58.6-58.62 Management information systems
Wee, Tan Chi
Kit, Chaw Jun
Yiqi, Tew
Lang, Wong Siaw
Tan, Gloria Jennis
Mohd. Suaib, Norhaida
An integrated text-based hybrid RNN-CNN model for toxic comment classification.
title An integrated text-based hybrid RNN-CNN model for toxic comment classification.
title_full An integrated text-based hybrid RNN-CNN model for toxic comment classification.
title_fullStr An integrated text-based hybrid RNN-CNN model for toxic comment classification.
title_full_unstemmed An integrated text-based hybrid RNN-CNN model for toxic comment classification.
title_short An integrated text-based hybrid RNN-CNN model for toxic comment classification.
title_sort integrated text based hybrid rnn cnn model for toxic comment classification
topic T58.6-58.62 Management information systems
work_keys_str_mv AT weetanchi anintegratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT kitchawjun anintegratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT yiqitew anintegratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT langwongsiaw anintegratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT tangloriajennis anintegratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT mohdsuaibnorhaida anintegratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT weetanchi integratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT kitchawjun integratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT yiqitew integratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT langwongsiaw integratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT tangloriajennis integratedtextbasedhybridrnncnnmodelfortoxiccommentclassification
AT mohdsuaibnorhaida integratedtextbasedhybridrnncnnmodelfortoxiccommentclassification