Development of a classification system on big data set using machine learning techniques

In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentime...

Full description

Bibliographic Details
Main Author: Tan, Zhi Ler
Other Authors: Chan Chee Keong
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149132
_version_ 1811682600673083392
author Tan, Zhi Ler
author2 Chan Chee Keong
author_facet Chan Chee Keong
Tan, Zhi Ler
author_sort Tan, Zhi Ler
collection NTU
description In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentiments of texts. This project aims to create classification models based on a Twitter dataset to classify Tweets to their sentiment class of either positive, negative, or neutral. 7 different classification models were explored and tuned to obtain accuracies ranging from 55%-70%. A Telegram bot that can output the sentiment of user inputs by using the trained classification models was made. By using Twitter APIs to stream Tweets, a real-time graph was also made which shows sentiment over time of a specified keyword.
first_indexed 2024-10-01T03:59:25Z
format Final Year Project (FYP)
id ntu-10356/149132
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:59:25Z
publishDate 2021
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1491322023-07-07T17:41:07Z Development of a classification system on big data set using machine learning techniques Tan, Zhi Ler Chan Chee Keong School of Electrical and Electronic Engineering ECKCHAN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Electrical and electronic engineering In this day and age, there are millions of people all around the world who are regular users of online social media platforms like Facebook, Twitter and Reddit. This has resulted in a huge amount of text data to be available online and is a good opportunity to be used to study and analyse sentiments of texts. This project aims to create classification models based on a Twitter dataset to classify Tweets to their sentiment class of either positive, negative, or neutral. 7 different classification models were explored and tuned to obtain accuracies ranging from 55%-70%. A Telegram bot that can output the sentiment of user inputs by using the trained classification models was made. By using Twitter APIs to stream Tweets, a real-time graph was also made which shows sentiment over time of a specified keyword. Bachelor of Engineering (Information Engineering and Media) 2021-05-27T06:54:54Z 2021-05-27T06:54:54Z 2021 Final Year Project (FYP) Tan, Z. L. (2021). Development of a classification system on big data set using machine learning techniques. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149132 https://hdl.handle.net/10356/149132 en application/pdf Nanyang Technological University
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
Tan, Zhi Ler
Development of a classification system on big data set using machine learning techniques
title Development of a classification system on big data set using machine learning techniques
title_full Development of a classification system on big data set using machine learning techniques
title_fullStr Development of a classification system on big data set using machine learning techniques
title_full_unstemmed Development of a classification system on big data set using machine learning techniques
title_short Development of a classification system on big data set using machine learning techniques
title_sort development of a classification system on big data set using machine learning techniques
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/149132
work_keys_str_mv AT tanzhiler developmentofaclassificationsystemonbigdatasetusingmachinelearningtechniques