Deep Neural Architecture for Recovering Dropped Pronouns in Korean

Pronouns are frequently dropped in Korean sentences, especially in text messages in the mobile phone environment. Restoring dropped pronouns can be a beneficial preprocessing task for machine translation, information extraction, spoken dialog systems, and many other applications. In this work, we ad...

Full description

Bibliographic Details
Main Authors: Sangkeun Jung, Changki Lee
Format: Article
Language:English
Published: Electronics and Telecommunications Research Institute (ETRI) 2018-04-01
Series:ETRI Journal
Subjects:
Online Access:https://doi.org/10.4218/etrij.2017-0085
_version_ 1828826148053712896
author Sangkeun Jung
Changki Lee
author_facet Sangkeun Jung
Changki Lee
author_sort Sangkeun Jung
collection DOAJ
description Pronouns are frequently dropped in Korean sentences, especially in text messages in the mobile phone environment. Restoring dropped pronouns can be a beneficial preprocessing task for machine translation, information extraction, spoken dialog systems, and many other applications. In this work, we address the problem of dropped pronoun recovery by resolving two simultaneous subtasks: detecting zero‐pronoun sentences and determining the type of dropped pronouns. The problems are statistically modeled by encoding the sentence and classifying types of dropped pronouns using a recurrent neural network (RNN) architecture. Various RNN‐based encoding architectures were investigated, and the stacked RNN was shown to be the best model for Korean zero‐pronoun recovery. The proposed method does not require any manual features to be implemented; nevertheless, it shows good performance.
first_indexed 2024-12-12T14:32:50Z
format Article
id doaj.art-5887001e0fa04630be143f0b0167e350
institution Directory Open Access Journal
issn 1225-6463
2233-7326
language English
last_indexed 2024-12-12T14:32:50Z
publishDate 2018-04-01
publisher Electronics and Telecommunications Research Institute (ETRI)
record_format Article
series ETRI Journal
spelling doaj.art-5887001e0fa04630be143f0b0167e3502022-12-22T00:21:27ZengElectronics and Telecommunications Research Institute (ETRI)ETRI Journal1225-64632233-73262018-04-0140225726510.4218/etrij.2017-008510.4218/etrij.2017-0085Deep Neural Architecture for Recovering Dropped Pronouns in KoreanSangkeun JungChangki LeePronouns are frequently dropped in Korean sentences, especially in text messages in the mobile phone environment. Restoring dropped pronouns can be a beneficial preprocessing task for machine translation, information extraction, spoken dialog systems, and many other applications. In this work, we address the problem of dropped pronoun recovery by resolving two simultaneous subtasks: detecting zero‐pronoun sentences and determining the type of dropped pronouns. The problems are statistically modeled by encoding the sentence and classifying types of dropped pronouns using a recurrent neural network (RNN) architecture. Various RNN‐based encoding architectures were investigated, and the stacked RNN was shown to be the best model for Korean zero‐pronoun recovery. The proposed method does not require any manual features to be implemented; nevertheless, it shows good performance.https://doi.org/10.4218/etrij.2017-0085Deep learningDropped pronoun recoveryLSTM EncodingZero pronoun
spellingShingle Sangkeun Jung
Changki Lee
Deep Neural Architecture for Recovering Dropped Pronouns in Korean
ETRI Journal
Deep learning
Dropped pronoun recovery
LSTM Encoding
Zero pronoun
title Deep Neural Architecture for Recovering Dropped Pronouns in Korean
title_full Deep Neural Architecture for Recovering Dropped Pronouns in Korean
title_fullStr Deep Neural Architecture for Recovering Dropped Pronouns in Korean
title_full_unstemmed Deep Neural Architecture for Recovering Dropped Pronouns in Korean
title_short Deep Neural Architecture for Recovering Dropped Pronouns in Korean
title_sort deep neural architecture for recovering dropped pronouns in korean
topic Deep learning
Dropped pronoun recovery
LSTM Encoding
Zero pronoun
url https://doi.org/10.4218/etrij.2017-0085
work_keys_str_mv AT sangkeunjung deepneuralarchitectureforrecoveringdroppedpronounsinkorean
AT changkilee deepneuralarchitectureforrecoveringdroppedpronounsinkorean