Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.

ICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10...

Full description

Bibliographic Details
Main Authors: YunZhi Chen, HuiJuan Lu, LanJuan Li
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5356997?pdf=render
_version_ 1818270186552164352
author YunZhi Chen
HuiJuan Lu
LanJuan Li
author_facet YunZhi Chen
HuiJuan Lu
LanJuan Li
author_sort YunZhi Chen
collection DOAJ
description ICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10 classification. This paper presents an improved approach based on the Longest Common Subsequence (LCS) and semantic similarity for automatic Chinese diagnoses, mapping from the disease names given by clinician to the disease names in ICD-10. LCS refers to the longest string that is a subsequence of every member of a given set of strings. The proposed method of improved LCS in this paper can increase the accuracy of processing in Chinese disease mapping.
first_indexed 2024-12-12T21:06:17Z
format Article
id doaj.art-ba0a2410ca484da484fd22dd0b06478c
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-12T21:06:17Z
publishDate 2017-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-ba0a2410ca484da484fd22dd0b06478c2022-12-22T00:12:01ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01123e017341010.1371/journal.pone.0173410Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.YunZhi ChenHuiJuan LuLanJuan LiICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10 classification. This paper presents an improved approach based on the Longest Common Subsequence (LCS) and semantic similarity for automatic Chinese diagnoses, mapping from the disease names given by clinician to the disease names in ICD-10. LCS refers to the longest string that is a subsequence of every member of a given set of strings. The proposed method of improved LCS in this paper can increase the accuracy of processing in Chinese disease mapping.http://europepmc.org/articles/PMC5356997?pdf=render
spellingShingle YunZhi Chen
HuiJuan Lu
LanJuan Li
Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
PLoS ONE
title Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
title_full Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
title_fullStr Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
title_full_unstemmed Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
title_short Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity.
title_sort automatic icd 10 coding algorithm using an improved longest common subsequence based on semantic similarity
url http://europepmc.org/articles/PMC5356997?pdf=render
work_keys_str_mv AT yunzhichen automaticicd10codingalgorithmusinganimprovedlongestcommonsubsequencebasedonsemanticsimilarity
AT huijuanlu automaticicd10codingalgorithmusinganimprovedlongestcommonsubsequencebasedonsemanticsimilarity
AT lanjuanli automaticicd10codingalgorithmusinganimprovedlongestcommonsubsequencebasedonsemanticsimilarity