Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction

With the widespread adoption of electronic health records, the amount of stored medical data has been increasing. Clinical data, often in the form of semi-structured or unstructured electronic medical records (EMRs), contains rich patient information. However, due to the use of natural language by p...

Full description

Bibliographic Details
Main Authors: Feng Li, Mingfeng Jiang, Hongzeng Xu, Yi Chen, Feng Chen, Wei Nie, Li Wang
Format: Article
Language:English
Published: AIMS Press 2024-02-01
Series:Mathematical Biosciences and Engineering
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/mbe.2024180?viewType=HTML
_version_ 1797263736340217856
author Feng Li
Mingfeng Jiang
Hongzeng Xu
Yi Chen
Feng Chen
Wei Nie
Li Wang
author_facet Feng Li
Mingfeng Jiang
Hongzeng Xu
Yi Chen
Feng Chen
Wei Nie
Li Wang
author_sort Feng Li
collection DOAJ
description With the widespread adoption of electronic health records, the amount of stored medical data has been increasing. Clinical data, often in the form of semi-structured or unstructured electronic medical records (EMRs), contains rich patient information. However, due to the use of natural language by physicians when composing these records, the effectiveness of traditional methods such as dictionaries, rule matching, and machine learning in the extraction of information from these unstructured texts falls short of clinical standards. In this paper, a novel deep-learning-based natural language extraction method is proposed to overcome current shortcomings in data governance and Gensini score automatic calculation in coronary angiography. A pre-trained model called bidirectional encoder representation from transformers (BERT) with strong text feature representation capabilities is employed as the feature representation layer. It is combined with bidirectional long short-term memory (BiLSTM) and conditional random field (CRF) models to extract both global and local features from the text. The study included an evaluation of the model on a dataset from a hospital in China and it was compared with another model to validate its practical advantages. Hence, the BiLSTM-CRF model was employed to automatically extract relevant coronary angiogram information from EMR texts. The achieved F1 score was 91.19, which is approximately 0.87 higher than the BERT-BiLSTM-CRF model.
first_indexed 2024-04-25T00:17:45Z
format Article
id doaj.art-6953ab67b0d44c679080de22d42cab3e
institution Directory Open Access Journal
issn 1551-0018
language English
last_indexed 2024-04-25T00:17:45Z
publishDate 2024-02-01
publisher AIMS Press
record_format Article
series Mathematical Biosciences and Engineering
spelling doaj.art-6953ab67b0d44c679080de22d42cab3e2024-03-13T01:23:23ZengAIMS PressMathematical Biosciences and Engineering1551-00182024-02-012134085410310.3934/mbe.2024180Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extractionFeng Li0Mingfeng Jiang1Hongzeng Xu2Yi Chen 3Feng Chen4Wei Nie5Li Wang61. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China2. Department of Cardiology, The People's Hospital of Liaoning Province, Liaoning, Shenyang 110011, China1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China1. School of Information and Electronic Engineering, Zhejiang Gongshang University, Hangzhou 310018, China3. College of Marine Electrical Engineering, Dalian Maritime University, Dalian 116026, ChinaWith the widespread adoption of electronic health records, the amount of stored medical data has been increasing. Clinical data, often in the form of semi-structured or unstructured electronic medical records (EMRs), contains rich patient information. However, due to the use of natural language by physicians when composing these records, the effectiveness of traditional methods such as dictionaries, rule matching, and machine learning in the extraction of information from these unstructured texts falls short of clinical standards. In this paper, a novel deep-learning-based natural language extraction method is proposed to overcome current shortcomings in data governance and Gensini score automatic calculation in coronary angiography. A pre-trained model called bidirectional encoder representation from transformers (BERT) with strong text feature representation capabilities is employed as the feature representation layer. It is combined with bidirectional long short-term memory (BiLSTM) and conditional random field (CRF) models to extract both global and local features from the text. The study included an evaluation of the model on a dataset from a hospital in China and it was compared with another model to validate its practical advantages. Hence, the BiLSTM-CRF model was employed to automatically extract relevant coronary angiogram information from EMR texts. The achieved F1 score was 91.19, which is approximately 0.87 higher than the BERT-BiLSTM-CRF model.https://www.aimspress.com/article/doi/10.3934/mbe.2024180?viewType=HTMLelectronic health recordscoronary angiographygensini scoredeep learning
spellingShingle Feng Li
Mingfeng Jiang
Hongzeng Xu
Yi Chen
Feng Chen
Wei Nie
Li Wang
Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction
Mathematical Biosciences and Engineering
electronic health records
coronary angiography
gensini score
deep learning
title Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction
title_full Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction
title_fullStr Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction
title_full_unstemmed Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction
title_short Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction
title_sort data governance and gensini score automatic calculation for coronary angiography with deep learning based natural language extraction
topic electronic health records
coronary angiography
gensini score
deep learning
url https://www.aimspress.com/article/doi/10.3934/mbe.2024180?viewType=HTML
work_keys_str_mv AT fengli datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction
AT mingfengjiang datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction
AT hongzengxu datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction
AT yichen datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction
AT fengchen datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction
AT weinie datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction
AT liwang datagovernanceandgensiniscoreautomaticcalculationforcoronaryangiographywithdeeplearningbasednaturallanguageextraction