A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations

Abstract Humans express their emotions in various ways, such as through facial expressions and voices. In particular, emotions are directly expressed or indirectly implied in the text of utterance. Research on the technology to identify emotions included in human speech and generate utterances is be...

Full description

Bibliographic Details
Main Authors:	SoYeop Yoo, HaYoung Lee, JeIn Song, OkRan Jeong
Format:	Article
Language:	English
Published:	Nature Portfolio 2023-10-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-023-45386-8

_version_	1797647324861693952
author	SoYeop Yoo HaYoung Lee JeIn Song OkRan Jeong
author_facet	SoYeop Yoo HaYoung Lee JeIn Song OkRan Jeong
author_sort	SoYeop Yoo
collection	DOAJ
description	Abstract Humans express their emotions in various ways, such as through facial expressions and voices. In particular, emotions are directly expressed or indirectly implied in the text of utterance. Research on the technology to identify emotions included in human speech and generate utterances is being conducted in conversational artificial intelligence technology. Despite the importance of recognizing the factors of previously generated emotions to generate emotion-based utterances, most of the existing datasets only provide the classification of emotions in text and utterances. In addition, in the case of Korean datasets, the classification of emotions is not diverse, and it is mainly biased toward negative emotion classification. In this paper, we propose KEmoFact, a Korean emotion-factor dataset for extracting emotion and factors in Korean conversations. We also define two tasks for the KEmoFact dataset, EFE (Emotion Factor Extraction) and EFPE (Emotion-Factor Pair Extraction), and propose baseline models for the tasks. We contribute to the study of conversational artificial intelligence, especially in Korean, one of the low-resource languages, by proposing the KEmoFact dataset and suggesting baseline models for two tasks.
first_indexed	2024-03-11T15:14:38Z
format	Article
id	doaj.art-4bc3144383c54c6d95dcf35dbdb76430
institution	Directory Open Access Journal
issn	2045-2322
language	English
last_indexed	2024-03-11T15:14:38Z
publishDate	2023-10-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj.art-4bc3144383c54c6d95dcf35dbdb764302023-10-29T12:22:13ZengNature PortfolioScientific Reports2045-23222023-10-0113111210.1038/s41598-023-45386-8A Korean emotion-factor dataset for extracting emotion and factors in Korean conversationsSoYeop Yoo0HaYoung Lee1JeIn Song2OkRan Jeong3School of Computing, Gachon UniversitySchool of Computing, Gachon UniversityScatter LabSchool of Computing, Gachon UniversityAbstract Humans express their emotions in various ways, such as through facial expressions and voices. In particular, emotions are directly expressed or indirectly implied in the text of utterance. Research on the technology to identify emotions included in human speech and generate utterances is being conducted in conversational artificial intelligence technology. Despite the importance of recognizing the factors of previously generated emotions to generate emotion-based utterances, most of the existing datasets only provide the classification of emotions in text and utterances. In addition, in the case of Korean datasets, the classification of emotions is not diverse, and it is mainly biased toward negative emotion classification. In this paper, we propose KEmoFact, a Korean emotion-factor dataset for extracting emotion and factors in Korean conversations. We also define two tasks for the KEmoFact dataset, EFE (Emotion Factor Extraction) and EFPE (Emotion-Factor Pair Extraction), and propose baseline models for the tasks. We contribute to the study of conversational artificial intelligence, especially in Korean, one of the low-resource languages, by proposing the KEmoFact dataset and suggesting baseline models for two tasks.https://doi.org/10.1038/s41598-023-45386-8
spellingShingle	SoYeop Yoo HaYoung Lee JeIn Song OkRan Jeong A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations Scientific Reports
title	A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations
title_full	A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations
title_fullStr	A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations
title_full_unstemmed	A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations
title_short	A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations
title_sort	korean emotion factor dataset for extracting emotion and factors in korean conversations
url	https://doi.org/10.1038/s41598-023-45386-8
work_keys_str_mv	AT soyeopyoo akoreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT hayounglee akoreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT jeinsong akoreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT okranjeong akoreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT soyeopyoo koreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT hayounglee koreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT jeinsong koreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations AT okranjeong koreanemotionfactordatasetforextractingemotionandfactorsinkoreanconversations

A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations

Similar Items