Europe PMC annotated full-text corpus for gene/proteins, diseases and organisms

Abstract Named entity recognition (NER) is a widely used text-mining and natural language processing (NLP) subtask. In recent years, deep learning methods have superseded traditional dictionary- and rule-based NER approaches. A high-quality dataset is essential to fully leverage recent deep learning...

Full description

Bibliographic Details
Main Authors: Xiao Yang, Shyamasree Saha, Aravind Venkatesan, Santosh Tirunagari, Vid Vartak, Johanna McEntyre
Format: Article
Language:English
Published: Nature Portfolio 2023-10-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-023-02617-x