Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data

As a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide de...

Full description

Bibliographic Details
Main Authors:	Zepeng Li, Jiawei Zhou, Zhengyi An, Wenchuan Cheng, Bin Hu
Format:	Article
Language:	English
Published:	MDPI AG 2022-03-01
Series:	Entropy
Subjects:	social media suicide ideation detection deep neural network imbalanced data Sina Weibo China
Online Access:	https://www.mdpi.com/1099-4300/24/4/442

_version_	1827628911611084800
author	Zepeng Li Jiawei Zhou Zhengyi An Wenchuan Cheng Bin Hu
author_facet	Zepeng Li Jiawei Zhou Zhengyi An Wenchuan Cheng Bin Hu
author_sort	Zepeng Li
collection	DOAJ
description	As a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide detection, but related research still faces some difficulties, such as data imbalance and expression implicitness. In this paper, we propose a Deep Hierarchical Ensemble model for Suicide Detection (DHE-SD) based on a hierarchical ensemble strategy, and construct a dataset based on Sina Weibo, which contains more than 550 thousand posts from 4521 users. To verify the effectiveness of the model, we also conduct experiments on a public Weibo dataset containing 7329 users’ posts. The proposed model achieves the best performance on both the constructed dataset and the public dataset. In addition, in order to make the model applicable to a wider population, we use the proposed sentence-level mask mechanism to delete user posts with strong suicidal ideation. Experiments show that the proposed model can still effectively identify social media users with suicidal ideation even when the performance of the baseline models decrease significantly.
first_indexed	2024-03-09T13:42:19Z
format	Article
id	doaj.art-714bf628e60a48a28b93540f2f3a52de
institution	Directory Open Access Journal
issn	1099-4300
language	English
last_indexed	2024-03-09T13:42:19Z
publishDate	2022-03-01
publisher	MDPI AG
record_format	Article
series	Entropy
spelling	doaj.art-714bf628e60a48a28b93540f2f3a52de2023-11-30T21:04:53ZengMDPI AGEntropy1099-43002022-03-0124444210.3390/e24040442Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media DataZepeng Li0Jiawei Zhou1Zhengyi An2Wenchuan Cheng3Bin Hu4School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaAs a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide detection, but related research still faces some difficulties, such as data imbalance and expression implicitness. In this paper, we propose a Deep Hierarchical Ensemble model for Suicide Detection (DHE-SD) based on a hierarchical ensemble strategy, and construct a dataset based on Sina Weibo, which contains more than 550 thousand posts from 4521 users. To verify the effectiveness of the model, we also conduct experiments on a public Weibo dataset containing 7329 users’ posts. The proposed model achieves the best performance on both the constructed dataset and the public dataset. In addition, in order to make the model applicable to a wider population, we use the proposed sentence-level mask mechanism to delete user posts with strong suicidal ideation. Experiments show that the proposed model can still effectively identify social media users with suicidal ideation even when the performance of the baseline models decrease significantly.https://www.mdpi.com/1099-4300/24/4/442social mediasuicide ideation detectiondeep neural networkimbalanced dataSina WeiboChina
spellingShingle	Zepeng Li Jiawei Zhou Zhengyi An Wenchuan Cheng Bin Hu Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data Entropy social media suicide ideation detection deep neural network imbalanced data Sina Weibo China
title	Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_full	Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_fullStr	Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_full_unstemmed	Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_short	Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_sort	deep hierarchical ensemble model for suicide detection on imbalanced social media data
topic	social media suicide ideation detection deep neural network imbalanced data Sina Weibo China
url	https://www.mdpi.com/1099-4300/24/4/442
work_keys_str_mv	AT zepengli deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata AT jiaweizhou deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata AT zhengyian deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata AT wenchuancheng deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata AT binhu deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata

Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data

Similar Items