Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records

The International Statistical Classification of Disease and Related Health Problems (ICD) is an international standard system for categorizing and reporting diseases, injuries, disorders, and health conditions. Most previously-proposed disease predicting systems need clinical information collected b...

Full description

Bibliographic Details
Main Authors: Jia-Lien Hsu, Teng-Jie Hsu, Chung-Ho Hsieh, Anandakumar Singaravelan
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/24/7116
_version_ 1827700235398283264
author Jia-Lien Hsu
Teng-Jie Hsu
Chung-Ho Hsieh
Anandakumar Singaravelan
author_facet Jia-Lien Hsu
Teng-Jie Hsu
Chung-Ho Hsieh
Anandakumar Singaravelan
author_sort Jia-Lien Hsu
collection DOAJ
description The International Statistical Classification of Disease and Related Health Problems (ICD) is an international standard system for categorizing and reporting diseases, injuries, disorders, and health conditions. Most previously-proposed disease predicting systems need clinical information collected by the medical staff from the patients in hospitals. In this paper, we propose a deep learning algorithm to classify disease types and identify diagnostic codes by using only the subjective component of progress notes in medical records. In this study, we have a dataset, consisting of about one hundred and sixty-eight thousand medical records, from a medical center, collected during 2003 and 2017. First, we apply standard text processing procedures to parse the sentences and word embedding techniques for vector representations. Next, we build a convolution neural network model on the medical records to predict the ICD-9 code by using a subjective component of the progress note. The prediction performance is evaluated by ten-fold cross-validation and yields an accuracy of 0.409, recall of 0.409 and precision of 0.436. If we only consider the “chapter match” of ICD-9 code, our model achieves an accuracy of 0.580, recall of 0.580, and precision of 0.582. Since our diagnostic code prediction model is solely based on subjective components (mainly, patients’ self-report descriptions), the proposed approach could serve as a remote and self-diagnosis assistance tool, prior to seeking medical advice or going to the hospital. In addition, our work may be used as a primary evaluation tool for discomfort in the rural area where medical resources are restricted.
first_indexed 2024-03-10T14:07:28Z
format Article
id doaj.art-12d91046fca944b49eded4022143079c
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-10T14:07:28Z
publishDate 2020-12-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-12d91046fca944b49eded4022143079c2023-11-21T00:27:07ZengMDPI AGSensors1424-82202020-12-012024711610.3390/s20247116Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical RecordsJia-Lien Hsu0Teng-Jie Hsu1Chung-Ho Hsieh2Anandakumar Singaravelan3Department of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City 242062, TaiwanDepartment of Computer Science and Information Engineering, Fu Jen Catholic University, New Taipei City 242062, TaiwanDepartment of General Surgery, Shin Kong Wu Ho-Su Memorial Hospital, Taipei 111045, TaiwanGraduate Institute of Applied Science and Engineering, Fu Jen Catholic University, New Taipei City 242062, TaiwanThe International Statistical Classification of Disease and Related Health Problems (ICD) is an international standard system for categorizing and reporting diseases, injuries, disorders, and health conditions. Most previously-proposed disease predicting systems need clinical information collected by the medical staff from the patients in hospitals. In this paper, we propose a deep learning algorithm to classify disease types and identify diagnostic codes by using only the subjective component of progress notes in medical records. In this study, we have a dataset, consisting of about one hundred and sixty-eight thousand medical records, from a medical center, collected during 2003 and 2017. First, we apply standard text processing procedures to parse the sentences and word embedding techniques for vector representations. Next, we build a convolution neural network model on the medical records to predict the ICD-9 code by using a subjective component of the progress note. The prediction performance is evaluated by ten-fold cross-validation and yields an accuracy of 0.409, recall of 0.409 and precision of 0.436. If we only consider the “chapter match” of ICD-9 code, our model achieves an accuracy of 0.580, recall of 0.580, and precision of 0.582. Since our diagnostic code prediction model is solely based on subjective components (mainly, patients’ self-report descriptions), the proposed approach could serve as a remote and self-diagnosis assistance tool, prior to seeking medical advice or going to the hospital. In addition, our work may be used as a primary evaluation tool for discomfort in the rural area where medical resources are restricted.https://www.mdpi.com/1424-8220/20/24/7116diagnosis code predictionconvolutional neural networkICD-9medical record
spellingShingle Jia-Lien Hsu
Teng-Jie Hsu
Chung-Ho Hsieh
Anandakumar Singaravelan
Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records
Sensors
diagnosis code prediction
convolutional neural network
ICD-9
medical record
title Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records
title_full Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records
title_fullStr Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records
title_full_unstemmed Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records
title_short Applying Convolutional Neural Networks to Predict the ICD-9 Codes of Medical Records
title_sort applying convolutional neural networks to predict the icd 9 codes of medical records
topic diagnosis code prediction
convolutional neural network
ICD-9
medical record
url https://www.mdpi.com/1424-8220/20/24/7116
work_keys_str_mv AT jialienhsu applyingconvolutionalneuralnetworkstopredicttheicd9codesofmedicalrecords
AT tengjiehsu applyingconvolutionalneuralnetworkstopredicttheicd9codesofmedicalrecords
AT chunghohsieh applyingconvolutionalneuralnetworkstopredicttheicd9codesofmedicalrecords
AT anandakumarsingaravelan applyingconvolutionalneuralnetworkstopredicttheicd9codesofmedicalrecords