Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM

A new deep learning model based on Seq2Seq and Bi-LSTM is proposed for Chinese text automatic proofreading. Different from the traditional rule-based and probabilistic statistical methods, a Chinese text automatic proofreading model is implemented by adding Bi-LSTM unit and attention mechanism based...

Full description

Bibliographic Details
Main Authors: Gong Yonggang, Wu Meng, Lian Xiaoqin, Pei Chenchen
Format: Article
Language:zho
Published: National Computer System Engineering Research Institute of China 2020-03-01
Series:Dianzi Jishu Yingyong
Subjects:
Online Access:http://www.chinaaet.com/article/3000116336
_version_ 1818513208223203328
author Gong Yonggang
Wu Meng
Lian Xiaoqin
Pei Chenchen
author_facet Gong Yonggang
Wu Meng
Lian Xiaoqin
Pei Chenchen
author_sort Gong Yonggang
collection DOAJ
description A new deep learning model based on Seq2Seq and Bi-LSTM is proposed for Chinese text automatic proofreading. Different from the traditional rule-based and probabilistic statistical methods, a Chinese text automatic proofreading model is implemented by adding Bi-LSTM unit and attention mechanism based on Seq2Seq infrastructure improvement. Comparative experiments of different models were carried out through the open data sets. Experimental results show that the new model can effectively deal with long-distance text errors and semantic errors. The addition of Bi-RNN and attention mechanism can improve the performance of Chinese text proofreading model.
first_indexed 2024-12-10T23:58:03Z
format Article
id doaj.art-fb9acbea03544e198dd4aa8d6e47b996
institution Directory Open Access Journal
issn 0258-7998
language zho
last_indexed 2024-12-10T23:58:03Z
publishDate 2020-03-01
publisher National Computer System Engineering Research Institute of China
record_format Article
series Dianzi Jishu Yingyong
spelling doaj.art-fb9acbea03544e198dd4aa8d6e47b9962022-12-22T01:28:32ZzhoNational Computer System Engineering Research Institute of ChinaDianzi Jishu Yingyong0258-79982020-03-01463424610.16157/j.issn.0258-7998.1902213000116336Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTMGong Yonggang0Wu Meng1Lian Xiaoqin2Pei Chenchen3Beijing Key Laboratory of Food Safety Big Data Technology,College of Computer and Information Engineering, Beijing Technology and Business University,Beijing 100048,ChinaBeijing Key Laboratory of Food Safety Big Data Technology,College of Computer and Information Engineering, Beijing Technology and Business University,Beijing 100048,ChinaBeijing Key Laboratory of Food Safety Big Data Technology,College of Computer and Information Engineering, Beijing Technology and Business University,Beijing 100048,ChinaBeijing Key Laboratory of Food Safety Big Data Technology,College of Computer and Information Engineering, Beijing Technology and Business University,Beijing 100048,ChinaA new deep learning model based on Seq2Seq and Bi-LSTM is proposed for Chinese text automatic proofreading. Different from the traditional rule-based and probabilistic statistical methods, a Chinese text automatic proofreading model is implemented by adding Bi-LSTM unit and attention mechanism based on Seq2Seq infrastructure improvement. Comparative experiments of different models were carried out through the open data sets. Experimental results show that the new model can effectively deal with long-distance text errors and semantic errors. The addition of Bi-RNN and attention mechanism can improve the performance of Chinese text proofreading model.http://www.chinaaet.com/article/3000116336chinese text proofreadingrecurrent neural networkseq2seqnatural language proceessing
spellingShingle Gong Yonggang
Wu Meng
Lian Xiaoqin
Pei Chenchen
Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM
Dianzi Jishu Yingyong
chinese text proofreading
recurrent neural network
seq2seq
natural language proceessing
title Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM
title_full Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM
title_fullStr Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM
title_full_unstemmed Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM
title_short Chinese text automatic proofreading model based on Seq2Seq and Bi-LSTM
title_sort chinese text automatic proofreading model based on seq2seq and bi lstm
topic chinese text proofreading
recurrent neural network
seq2seq
natural language proceessing
url http://www.chinaaet.com/article/3000116336
work_keys_str_mv AT gongyonggang chinesetextautomaticproofreadingmodelbasedonseq2seqandbilstm
AT wumeng chinesetextautomaticproofreadingmodelbasedonseq2seqandbilstm
AT lianxiaoqin chinesetextautomaticproofreadingmodelbasedonseq2seqandbilstm
AT peichenchen chinesetextautomaticproofreadingmodelbasedonseq2seqandbilstm