Automatic allergy classification based on Russian unstructured medical texts

Most of the medical data in hospital information systems databases are stored in an unstructured form. Techniques for processing unstructured records are widely presented in scientific papers focused on English data. This paper proposes a method for intellectual analysis of unstructured allergy anam...

Full description

Bibliographic Details
Main Authors: Iuliia D. Lenivtceva, Georgy D. Kopanitsa
Format: Article
Language:English
Published: Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University) 2021-06-01
Series:Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
Subjects:
Online Access:https://ntv.ifmo.ru/file/article/20517.pdf
_version_ 1818442574800617472
author Iuliia D. Lenivtceva
Georgy D. Kopanitsa
author_facet Iuliia D. Lenivtceva
Georgy D. Kopanitsa
author_sort Iuliia D. Lenivtceva
collection DOAJ
description Most of the medical data in hospital information systems databases are stored in an unstructured form. Techniques for processing unstructured records are widely presented in scientific papers focused on English data. This paper proposes a method for intellectual analysis of unstructured allergy anamnesis in Russian in order to identify the presence and type of allergy and intolerance of a patient. The method is based on machine learning algorithms and uses international standards for the exchange of medical data and terminology standards, such as FHIR and SNOMED CT. As a result of the experiment, about 12 thousand medical records were processed. F-measure for the developed classification models ranged from 0.93 to 0.96. The models showed high values of metrics for evaluating the effectiveness of the models. In the future, structured data can be used in models for predicting medical risks. Further development of methods for structuring medical texts will ensure the interoperability of medical data.
first_indexed 2024-12-14T18:46:19Z
format Article
id doaj.art-d8fc923971f947ea9cfc7b5401fab1ac
institution Directory Open Access Journal
issn 2226-1494
2500-0373
language English
last_indexed 2024-12-14T18:46:19Z
publishDate 2021-06-01
publisher Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
record_format Article
series Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
spelling doaj.art-d8fc923971f947ea9cfc7b5401fab1ac2022-12-21T22:51:22ZengSaint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki2226-14942500-03732021-06-0121343343610.17586/2226-1494-2021-21-3-433-436Automatic allergy classification based on Russian unstructured medical textsIuliia D. Lenivtceva0https://orcid.org/0000-0002-5572-5151Georgy D. Kopanitsa1https://orcid.org/0000-0002-6231-8036Engineer, ITMO University, Saint Petersburg, 197101, Russian FederationPhD, Leading Researcher, ITMO University, Saint Petersburg, 197101, Russian FederationMost of the medical data in hospital information systems databases are stored in an unstructured form. Techniques for processing unstructured records are widely presented in scientific papers focused on English data. This paper proposes a method for intellectual analysis of unstructured allergy anamnesis in Russian in order to identify the presence and type of allergy and intolerance of a patient. The method is based on machine learning algorithms and uses international standards for the exchange of medical data and terminology standards, such as FHIR and SNOMED CT. As a result of the experiment, about 12 thousand medical records were processed. F-measure for the developed classification models ranged from 0.93 to 0.96. The models showed high values of metrics for evaluating the effectiveness of the models. In the future, structured data can be used in models for predicting medical risks. Further development of methods for structuring medical texts will ensure the interoperability of medical data.https://ntv.ifmo.ru/file/article/20517.pdfmedical data structuringallergyintolerancemachine learningunstructured text analysisinteroperability
spellingShingle Iuliia D. Lenivtceva
Georgy D. Kopanitsa
Automatic allergy classification based on Russian unstructured medical texts
Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
medical data structuring
allergy
intolerance
machine learning
unstructured text analysis
interoperability
title Automatic allergy classification based on Russian unstructured medical texts
title_full Automatic allergy classification based on Russian unstructured medical texts
title_fullStr Automatic allergy classification based on Russian unstructured medical texts
title_full_unstemmed Automatic allergy classification based on Russian unstructured medical texts
title_short Automatic allergy classification based on Russian unstructured medical texts
title_sort automatic allergy classification based on russian unstructured medical texts
topic medical data structuring
allergy
intolerance
machine learning
unstructured text analysis
interoperability
url https://ntv.ifmo.ru/file/article/20517.pdf
work_keys_str_mv AT iuliiadlenivtceva automaticallergyclassificationbasedonrussianunstructuredmedicaltexts
AT georgydkopanitsa automaticallergyclassificationbasedonrussianunstructuredmedicaltexts