Hierarchical shared transfer learning for biomedical named entity recognition
Abstract Background Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2022-01-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-021-04551-4 |
_version_ | 1819229332513488896 |
---|---|
author | Zhaoying Chai Han Jin Shenghui Shi Siyan Zhan Lin Zhuo Yu Yang |
author_facet | Zhaoying Chai Han Jin Shenghui Shi Siyan Zhan Lin Zhuo Yu Yang |
author_sort | Zhaoying Chai |
collection | DOAJ |
description | Abstract Background Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven context coding ability. However, in BioNER task, deep learning has the problem of poor generalization and instability. Results we propose the hierarchical shared transfer learning, which combines multi-task learning and fine-tuning, and realizes the multi-level information fusion between the underlying entity features and the upper data features. We select 14 datasets containing 4 types of entities for training and evaluate the model. The experimental results showed that the F1-scores of the five gold standard datasets BC5CDR-chemical, BC5CDR-disease, BC2GM, BC4CHEMD, NCBI-disease and LINNAEUS were increased by 0.57, 0.90, 0.42, 0.77, 0.98 and − 2.16 compared to the single-task XLNet-CRF model. BC5CDR-chemical, BC5CDR-disease and BC4CHEMD achieved state-of-the-art results.The reasons why LINNAEUS’s multi-task results are lower than single-task results are discussed at the dataset level. Conclusion Compared with using multi-task learning and fine-tuning alone, the model has more accurate recognition ability of medical entities, and has higher generalization and stability. |
first_indexed | 2024-12-23T11:11:30Z |
format | Article |
id | doaj.art-5cc9d4ed5292411c940434f79b0cfc42 |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-12-23T11:11:30Z |
publishDate | 2022-01-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-5cc9d4ed5292411c940434f79b0cfc422022-12-21T17:49:20ZengBMCBMC Bioinformatics1471-21052022-01-0123111410.1186/s12859-021-04551-4Hierarchical shared transfer learning for biomedical named entity recognitionZhaoying Chai0Han Jin1Shenghui Shi2Siyan Zhan3Lin Zhuo4Yu Yang5College of Information Science and Technology, Beijing University of Chemical TechnologyCollege of Information Science and Technology, Beijing University of Chemical TechnologyCollege of Information Science and Technology, Beijing University of Chemical TechnologySchool of Public Health, Peking UniversityResearch Center of Clinical Epidemiology, Peking University Third HospitalNational Institute of Health Data Science, Peking UniversityAbstract Background Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven context coding ability. However, in BioNER task, deep learning has the problem of poor generalization and instability. Results we propose the hierarchical shared transfer learning, which combines multi-task learning and fine-tuning, and realizes the multi-level information fusion between the underlying entity features and the upper data features. We select 14 datasets containing 4 types of entities for training and evaluate the model. The experimental results showed that the F1-scores of the five gold standard datasets BC5CDR-chemical, BC5CDR-disease, BC2GM, BC4CHEMD, NCBI-disease and LINNAEUS were increased by 0.57, 0.90, 0.42, 0.77, 0.98 and − 2.16 compared to the single-task XLNet-CRF model. BC5CDR-chemical, BC5CDR-disease and BC4CHEMD achieved state-of-the-art results.The reasons why LINNAEUS’s multi-task results are lower than single-task results are discussed at the dataset level. Conclusion Compared with using multi-task learning and fine-tuning alone, the model has more accurate recognition ability of medical entities, and has higher generalization and stability.https://doi.org/10.1186/s12859-021-04551-4BioNLPBiomedical named entity recognitionTransfer learningPermutation language modelConditional random field |
spellingShingle | Zhaoying Chai Han Jin Shenghui Shi Siyan Zhan Lin Zhuo Yu Yang Hierarchical shared transfer learning for biomedical named entity recognition BMC Bioinformatics BioNLP Biomedical named entity recognition Transfer learning Permutation language model Conditional random field |
title | Hierarchical shared transfer learning for biomedical named entity recognition |
title_full | Hierarchical shared transfer learning for biomedical named entity recognition |
title_fullStr | Hierarchical shared transfer learning for biomedical named entity recognition |
title_full_unstemmed | Hierarchical shared transfer learning for biomedical named entity recognition |
title_short | Hierarchical shared transfer learning for biomedical named entity recognition |
title_sort | hierarchical shared transfer learning for biomedical named entity recognition |
topic | BioNLP Biomedical named entity recognition Transfer learning Permutation language model Conditional random field |
url | https://doi.org/10.1186/s12859-021-04551-4 |
work_keys_str_mv | AT zhaoyingchai hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT hanjin hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT shenghuishi hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT siyanzhan hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT linzhuo hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT yuyang hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition |