Text data mining approach for mental health prediction

Depression has been considered one of the most common mental disorders around the world with far-reaching negative impacts on those who suffer from it. On top of that, social media has taken the world by storm, with around 60% of the global population and 92.7% of all internet users being social med...

Full description

Bibliographic Details
Main Author: Chan, Ian Jia Jun
Other Authors: Vidya Sudarshan
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175167
_version_ 1811696796499443712
author Chan, Ian Jia Jun
author2 Vidya Sudarshan
author_facet Vidya Sudarshan
Chan, Ian Jia Jun
author_sort Chan, Ian Jia Jun
collection NTU
description Depression has been considered one of the most common mental disorders around the world with far-reaching negative impacts on those who suffer from it. On top of that, social media has taken the world by storm, with around 60% of the global population and 92.7% of all internet users being social media users. As such, users tend to express their emotions and thoughts through text on social media platforms. The advancements in Artificial Intelligence, namely Natural Language Processing (NLP), has allowed for the understanding of text and the underlying emotion within. This is called Sentiment Analysis. With the proper Sentiment Analysis model, the detection of depressive text can help identify potential depression and raise awareness of mental health. In this study, this paper compares existing tools, namely Long Short-Term Memory (LSTM), and Bi-directional Encoder Representation from Transformers (BERT), for depression detection and examine the shortfalls of these models. Furthermore, by using existing models as a base and tweaking it, this paper aims to produce something better. Not only that, but this paper will only investigate depression text classification and not a wider range of classes. This is in hopes of producing models that are more accurate and robust in this binary class classification. This paper proposes the use of pre-trained BERT to outperform existing Sentiment Analysis models. Furthermore, this paper proposes a novel modification to the BERT classifier to possibly perform better. This paper examines the performance of these models with a wide range of metrics and aims to produce a model that can accurately detect underlying depression among textual data.
first_indexed 2024-10-01T07:45:03Z
format Final Year Project (FYP)
id ntu-10356/175167
institution Nanyang Technological University
language English
last_indexed 2024-10-01T07:45:03Z
publishDate 2024
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1751672024-04-26T15:41:20Z Text data mining approach for mental health prediction Chan, Ian Jia Jun Vidya Sudarshan School of Computer Science and Engineering vidya.sudarshan@ntu.edu.sg Computer and Information Science Engineering Sentiment analysis Mental health prediction BERT Depression has been considered one of the most common mental disorders around the world with far-reaching negative impacts on those who suffer from it. On top of that, social media has taken the world by storm, with around 60% of the global population and 92.7% of all internet users being social media users. As such, users tend to express their emotions and thoughts through text on social media platforms. The advancements in Artificial Intelligence, namely Natural Language Processing (NLP), has allowed for the understanding of text and the underlying emotion within. This is called Sentiment Analysis. With the proper Sentiment Analysis model, the detection of depressive text can help identify potential depression and raise awareness of mental health. In this study, this paper compares existing tools, namely Long Short-Term Memory (LSTM), and Bi-directional Encoder Representation from Transformers (BERT), for depression detection and examine the shortfalls of these models. Furthermore, by using existing models as a base and tweaking it, this paper aims to produce something better. Not only that, but this paper will only investigate depression text classification and not a wider range of classes. This is in hopes of producing models that are more accurate and robust in this binary class classification. This paper proposes the use of pre-trained BERT to outperform existing Sentiment Analysis models. Furthermore, this paper proposes a novel modification to the BERT classifier to possibly perform better. This paper examines the performance of these models with a wide range of metrics and aims to produce a model that can accurately detect underlying depression among textual data. Bachelor's degree 2024-04-22T06:05:43Z 2024-04-22T06:05:43Z 2024 Final Year Project (FYP) Chan, I. J. J. (2024). Text data mining approach for mental health prediction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175167 https://hdl.handle.net/10356/175167 en SCSE23-0695 application/pdf Nanyang Technological University
spellingShingle Computer and Information Science
Engineering
Sentiment analysis
Mental health prediction
BERT
Chan, Ian Jia Jun
Text data mining approach for mental health prediction
title Text data mining approach for mental health prediction
title_full Text data mining approach for mental health prediction
title_fullStr Text data mining approach for mental health prediction
title_full_unstemmed Text data mining approach for mental health prediction
title_short Text data mining approach for mental health prediction
title_sort text data mining approach for mental health prediction
topic Computer and Information Science
Engineering
Sentiment analysis
Mental health prediction
BERT
url https://hdl.handle.net/10356/175167
work_keys_str_mv AT chanianjiajun textdataminingapproachformentalhealthprediction