Uncovering the Risks and Drawbacks Associated With the Use of Synthetic Data for Grammatical Error Correction

In a Data-Centric AI paradigm, the model performance is enhanced without altering the model architecture, as evidenced by real-world and benchmark dataset demonstrations. With the advancements of large language models (LLM), it has become increasingly feasible to generate high-quality synthetic data...

Full description

Bibliographic Details
Main Authors:	Seonmin Koo, Chanjun Park, Seolhwa Lee, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Korean grammatical error correction synthetic data noise injection balanced data
Online Access:	https://ieeexplore.ieee.org/document/10234394/

Internet

https://ieeexplore.ieee.org/document/10234394/

Uncovering the Risks and Drawbacks Associated With the Use of Synthetic Data for Grammatical Error Correction

Internet

Similar Items