A Benchmark Dataset to Distinguish Human-Written and Machine-Generated Scientific Papers
As generative NLP can now produce content nearly indistinguishable from human writing, it is becoming difficult to identify genuine research contributions in academic writing and scientific publications. Moreover, information in machine-generated text can be factually wrong or even entirely fabricat...
Main Authors: | Mohamed Hesham Ibrahim Abdalla, Simon Malberg, Daryna Dementieva, Edoardo Mosca, Georg Groh |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-09-01
|
Series: | Information |
Subjects: | |
Online Access: | https://www.mdpi.com/2078-2489/14/10/522 |
Similar Items
-
Beyond Lexical Boundaries: LLM-Generated Text Detection for Romanian Digital Libraries
by: Melania Nitu, et al.
Published: (2024-01-01) -
Survey of Deep Learning Table-to-Text Generation
by: HU Kang, XI Xuefeng, CUI Zhiming, ZHOU Yueyao, QIU Yajin
Published: (2022-11-01) -
Recent Advances in Text-to-Image Synthesis: Approaches, Datasets and Future Research Prospects
by: Yong Xuan Tan, et al.
Published: (2023-01-01) -
Application of Generative Adversarial Networks and Shapley Algorithm Based on Easy Data Augmentation for Imbalanced Text Data
by: Jheng-Long Wu, et al.
Published: (2022-10-01) -
Content Order-Controllable MR-to-Text
by: Keisuke Toyama, et al.
Published: (2023-01-01)