Organizing an in-class hackathon to correct PDF-to-text conversion errors of 1.0
This paper describes a community effort to improve earlier versions of the full-text corpus of Genomics & Informatics by semi-automatically detecting and correcting PDF-to-text conversion errors and optical character recognition errors during the first hackathon of Genomics & Informatics Ann...
Main Authors: | Sunho Kim, Royoung Kim, Hee-Jo Nam, Ryeo-Gyeong Kim, Enjin Ko, Han-Su Kim, Jihye Shin, Daeun Cho, Yurhee Jin, Soyeon Bae, Ye Won Jo, San Ah Jeong, Yena Kim, Seoyeon Ahn, Bomi Jang, Jiheyon Seong, Yujin Lee, Si Eun Seo, Yujin Kim, Ha-Jeong Kim, Hyeji Kim, Hye-Lynn Sung, Hyoyoung Lho, Jaywon Koo, Jion Chu, Juwon Lim, Youngju Kim, Kyungyeon Lee, Yuri Lim, Meongeun Kim, Seonjeong Hwang, Shinhye Han, Sohyeun Bae, Sua Kim, Suhyeon Yoo, Yeonjeong Seo, Yerim Shin, Yonsoo Kim, You-Jung Ko, Jihee Baek, Hyejin Hyun, Hyemin Choi, Ji-Hye Oh, Da-Young Kim, Hyun-Seok Park |
---|---|
Format: | Article |
Language: | English |
Published: |
Korea Genome Organization
2020-09-01
|
Series: | Genomics & Informatics |
Subjects: | |
Online Access: | http://genominfo.org/upload/pdf/gi-2020-18-3-e33.pdf |
Similar Items
-
Study of Analyzing Outcome of Building and Introducing System for Preserving Full-Text of e-Journal
by: Kwang-Young Kim, et al.
Published: (2012-12-01) -
A Topical Category-Aware Neural Text Summarizer
by: So-Eon Kim, et al.
Published: (2020-08-01) -
Extending TextAE for annotation of non-contiguous entities
by: Jake Lever, et al.
Published: (2020-06-01) -
Evaluation and Analysis of Large Language Models for Clinical Text Augmentation and Generation
by: Atif Latif, et al.
Published: (2024-01-01) -
Automatic Linkage Model of Classification Systems Based on a Pretraining Language Model for Interconnecting Science and Technology with Job Information
by: Hyun Ji Jeong, et al.
Published: (2022-06-01)