Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model

Chinese grammatical error correction (CGEC) is practically useful for learners of Chinese as a second language, but it is a rather challenging task due to the complex and flexible nature of Chinese language so that existing methods for English cannot be directly applied. In this paper, we introduce...

Full description

Bibliographic Details
Main Authors: Si Li, Jianbo Zhao, Guirong Shi, Yuanpeng Tan, Huifang Xu, Guang Chen, Haibo Lan, Zhiqing Lin
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8717692/
_version_ 1818621623429758976
author Si Li
Jianbo Zhao
Guirong Shi
Yuanpeng Tan
Huifang Xu
Guang Chen
Haibo Lan
Zhiqing Lin
author_facet Si Li
Jianbo Zhao
Guirong Shi
Yuanpeng Tan
Huifang Xu
Guang Chen
Haibo Lan
Zhiqing Lin
author_sort Si Li
collection DOAJ
description Chinese grammatical error correction (CGEC) is practically useful for learners of Chinese as a second language, but it is a rather challenging task due to the complex and flexible nature of Chinese language so that existing methods for English cannot be directly applied. In this paper, we introduce a convolutional sequence to sequence model into the CGEC task for the first time, since many Chinese grammatical errors are concentrated between three and four words and convolutional neural network can better capture the local context. A convolution-based model can obtain the representations of the context by fixed size kernel. By stacking convolution layers, long-term dependences can be obtained. We also propose two optimization methods, shared embedding and policy gradient, to optimize the convolutional sequence to sequence model through sharing parameters and reconstructing loss function. Besides, we collate the existing Chinese grammatical correction corpus in detail. The results show that the models we proposed two different optimization methods both achieve large improvement compared with the natural machine translation model based on a recurrent neural network.
first_indexed 2024-12-16T18:12:13Z
format Article
id doaj.art-fb098062980849e5a89824b3694cfe1f
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-16T18:12:13Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-fb098062980849e5a89824b3694cfe1f2022-12-21T22:21:44ZengIEEEIEEE Access2169-35362019-01-017729057291310.1109/ACCESS.2019.29176318717692Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence ModelSi Li0Jianbo Zhao1https://orcid.org/0000-0003-3755-4691Guirong Shi2Yuanpeng Tan3Huifang Xu4Guang Chen5Haibo Lan6Zhiqing Lin7School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaState Grid Jibei Electric Power Company Limited, Beijing, ChinaChina Electric Power Research Institute, Beijing, ChinaChina Electric Power Research Institute, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaState Grid Jibei Electric Power Company Limited, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaChinese grammatical error correction (CGEC) is practically useful for learners of Chinese as a second language, but it is a rather challenging task due to the complex and flexible nature of Chinese language so that existing methods for English cannot be directly applied. In this paper, we introduce a convolutional sequence to sequence model into the CGEC task for the first time, since many Chinese grammatical errors are concentrated between three and four words and convolutional neural network can better capture the local context. A convolution-based model can obtain the representations of the context by fixed size kernel. By stacking convolution layers, long-term dependences can be obtained. We also propose two optimization methods, shared embedding and policy gradient, to optimize the convolutional sequence to sequence model through sharing parameters and reconstructing loss function. Besides, we collate the existing Chinese grammatical correction corpus in detail. The results show that the models we proposed two different optimization methods both achieve large improvement compared with the natural machine translation model based on a recurrent neural network.https://ieeexplore.ieee.org/document/8717692/Chinese grammatical error correctionsequence to sequenceconvolutional
spellingShingle Si Li
Jianbo Zhao
Guirong Shi
Yuanpeng Tan
Huifang Xu
Guang Chen
Haibo Lan
Zhiqing Lin
Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
IEEE Access
Chinese grammatical error correction
sequence to sequence
convolutional
title Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
title_full Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
title_fullStr Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
title_full_unstemmed Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
title_short Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
title_sort chinese grammatical error correction based on convolutional sequence to sequence model
topic Chinese grammatical error correction
sequence to sequence
convolutional
url https://ieeexplore.ieee.org/document/8717692/
work_keys_str_mv AT sili chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT jianbozhao chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT guirongshi chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT yuanpengtan chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT huifangxu chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT guangchen chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT haibolan chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel
AT zhiqinglin chinesegrammaticalerrorcorrectionbasedonconvolutionalsequencetosequencemodel