Enhancing online safety: leveraging large language models for community moderation in Singlish dialect

Online forums and comment sections have become ubiquitous features on social media platforms, providing users with the freedom to share their thoughts and opinions openly. This freedom, however, comes with a risk as users may abuse it by posting toxic or harmful comments. The anonymity and lack of i...

Full description

Bibliographic Details
Main Author: Goh, Zheng Ying
Other Authors: Dusit Niyato
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/174224
_version_ 1811689640338391040
author Goh, Zheng Ying
author2 Dusit Niyato
author_facet Dusit Niyato
Goh, Zheng Ying
author_sort Goh, Zheng Ying
collection NTU
description Online forums and comment sections have become ubiquitous features on social media platforms, providing users with the freedom to share their thoughts and opinions openly. This freedom, however, comes with a risk as users may abuse it by posting toxic or harmful comments. The anonymity and lack of immediate consequences on these platforms often embolden individuals to engage in disrespectful or offensive behavior, contributing to the proliferation of toxic content. In recent years, there has been a surge in interest surrounding transformers, largely driven by the success of models like ChatGPT. These transformer-based models have demonstrated remarkable capabilities in natural language processing tasks, including text generation and comprehension. As a result, researchers and practitioners have increasingly turned to transformers to address various challenges in the field of language processing, including content moderation. This project aims to leverage transformer technology to tackle the issue of toxic content in online forums, with a particular focus on Singlish forums. Singlish refers to a variety of English spoken in Singapore, characterized by its unique vocabulary, grammar, and intonation. The project seeks to fine-tune a pre-trained BERT model specializing in toxic comments detection, developed by Unitary, for content moderation of Singlish forums using Parameter Efficient Fine-Tuning (PEFT) methods.
first_indexed 2024-10-01T05:51:19Z
format Final Year Project (FYP)
id ntu-10356/174224
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:51:19Z
publishDate 2024
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1742242024-03-22T15:37:52Z Enhancing online safety: leveraging large language models for community moderation in Singlish dialect Goh, Zheng Ying Dusit Niyato School of Computer Science and Engineering DNIYATO@ntu.edu.sg Computer and Information Science Transformers Large language model Online forums and comment sections have become ubiquitous features on social media platforms, providing users with the freedom to share their thoughts and opinions openly. This freedom, however, comes with a risk as users may abuse it by posting toxic or harmful comments. The anonymity and lack of immediate consequences on these platforms often embolden individuals to engage in disrespectful or offensive behavior, contributing to the proliferation of toxic content. In recent years, there has been a surge in interest surrounding transformers, largely driven by the success of models like ChatGPT. These transformer-based models have demonstrated remarkable capabilities in natural language processing tasks, including text generation and comprehension. As a result, researchers and practitioners have increasingly turned to transformers to address various challenges in the field of language processing, including content moderation. This project aims to leverage transformer technology to tackle the issue of toxic content in online forums, with a particular focus on Singlish forums. Singlish refers to a variety of English spoken in Singapore, characterized by its unique vocabulary, grammar, and intonation. The project seeks to fine-tune a pre-trained BERT model specializing in toxic comments detection, developed by Unitary, for content moderation of Singlish forums using Parameter Efficient Fine-Tuning (PEFT) methods. Bachelor's degree 2024-03-21T22:55:39Z 2024-03-21T22:55:39Z 2024 Final Year Project (FYP) Goh, Z. Y. (2024). Enhancing online safety: leveraging large language models for community moderation in Singlish dialect. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/174224 https://hdl.handle.net/10356/174224 en SCSE23-0682 application/pdf Nanyang Technological University
spellingShingle Computer and Information Science
Transformers
Large language model
Goh, Zheng Ying
Enhancing online safety: leveraging large language models for community moderation in Singlish dialect
title Enhancing online safety: leveraging large language models for community moderation in Singlish dialect
title_full Enhancing online safety: leveraging large language models for community moderation in Singlish dialect
title_fullStr Enhancing online safety: leveraging large language models for community moderation in Singlish dialect
title_full_unstemmed Enhancing online safety: leveraging large language models for community moderation in Singlish dialect
title_short Enhancing online safety: leveraging large language models for community moderation in Singlish dialect
title_sort enhancing online safety leveraging large language models for community moderation in singlish dialect
topic Computer and Information Science
Transformers
Large language model
url https://hdl.handle.net/10356/174224
work_keys_str_mv AT gohzhengying enhancingonlinesafetyleveraginglargelanguagemodelsforcommunitymoderationinsinglishdialect