Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
ICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2017-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5356997?pdf=render |
_version_ | 1818270186552164352 |
---|---|
author | YunZhi Chen HuiJuan Lu LanJuan Li |
author_facet | YunZhi Chen HuiJuan Lu LanJuan Li |
author_sort | YunZhi Chen |
collection | DOAJ |
description | ICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10 classification. This paper presents an improved approach based on the Longest Common Subsequence (LCS) and semantic similarity for automatic Chinese diagnoses, mapping from the disease names given by clinician to the disease names in ICD-10. LCS refers to the longest string that is a subsequence of every member of a given set of strings. The proposed method of improved LCS in this paper can increase the accuracy of processing in Chinese disease mapping. |
first_indexed | 2024-12-12T21:06:17Z |
format | Article |
id | doaj.art-ba0a2410ca484da484fd22dd0b06478c |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-12T21:06:17Z |
publishDate | 2017-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-ba0a2410ca484da484fd22dd0b06478c2022-12-22T00:12:01ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01123e017341010.1371/journal.pone.0173410Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.YunZhi ChenHuiJuan LuLanJuan LiICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10 classification. This paper presents an improved approach based on the Longest Common Subsequence (LCS) and semantic similarity for automatic Chinese diagnoses, mapping from the disease names given by clinician to the disease names in ICD-10. LCS refers to the longest string that is a subsequence of every member of a given set of strings. The proposed method of improved LCS in this paper can increase the accuracy of processing in Chinese disease mapping.http://europepmc.org/articles/PMC5356997?pdf=render |
spellingShingle | YunZhi Chen HuiJuan Lu LanJuan Li Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. PLoS ONE |
title | Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. |
title_full | Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. |
title_fullStr | Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. |
title_full_unstemmed | Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. |
title_short | Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity. |
title_sort | automatic icd 10 coding algorithm using an improved longest common subsequence based on semantic similarity |
url | http://europepmc.org/articles/PMC5356997?pdf=render |
work_keys_str_mv | AT yunzhichen automaticicd10codingalgorithmusinganimprovedlongestcommonsubsequencebasedonsemanticsimilarity AT huijuanlu automaticicd10codingalgorithmusinganimprovedlongestcommonsubsequencebasedonsemanticsimilarity AT lanjuanli automaticicd10codingalgorithmusinganimprovedlongestcommonsubsequencebasedonsemanticsimilarity |