Evaluating MedDRA-to-ICD terminology mappings

Abstract Background In this era of big data, data harmonization is an important step to ensure reproducible, scalable, and collaborative research. Thus, terminology mapping is a necessary step to harmonize heterogeneous data. Take the Medical Dictionary for Regulatory Activities (MedDRA) and Interna...

Full description

Bibliographic Details
Main Authors: Xinyuan Zhang, Yixue Feng, Fang Li, Jin Ding, Danyal Tahseen, Ezekiel Hinojosa, Yong Chen, Cui Tao
Format: Article
Language:English
Published: BMC 2024-02-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-023-02375-1
_version_ 1797274369631715328
author Xinyuan Zhang
Yixue Feng
Fang Li
Jin Ding
Danyal Tahseen
Ezekiel Hinojosa
Yong Chen
Cui Tao
author_facet Xinyuan Zhang
Yixue Feng
Fang Li
Jin Ding
Danyal Tahseen
Ezekiel Hinojosa
Yong Chen
Cui Tao
author_sort Xinyuan Zhang
collection DOAJ
description Abstract Background In this era of big data, data harmonization is an important step to ensure reproducible, scalable, and collaborative research. Thus, terminology mapping is a necessary step to harmonize heterogeneous data. Take the Medical Dictionary for Regulatory Activities (MedDRA) and International Classification of Diseases (ICD) for example, the mapping between them is essential for drug safety and pharmacovigilance research. Our main objective is to provide a quantitative and qualitative analysis of the mapping status between MedDRA and ICD. We focus on evaluating the current mapping status between MedDRA and ICD through the Unified Medical Language System (UMLS) and Observational Medical Outcomes Partnership Common Data Model (OMOP CDM). We summarized the current mapping statistics and evaluated the quality of the current MedDRA-ICD mapping; for unmapped terms, we used our self-developed algorithm to rank the best possible mapping candidates for additional mapping coverage. Results The identified MedDRA-ICD mapped pairs cover 27.23% of the overall MedDRA preferred terms (PT). The systematic quality analysis demonstrated that, among the mapped pairs provided by UMLS, only 51.44% are considered an exact match. For the 2400 sampled unmapped terms, 56 of the 2400 MedDRA Preferred Terms (PT) could have exact match terms from ICD. Conclusion Some of the mapped pairs between MedDRA and ICD are not exact matches due to differences in granularity and focus. For 72% of the unmapped PT terms, the identified exact match pairs illustrate the possibility of identifying additional mapped pairs. Referring to its own mapping standard, some of the unmapped terms should qualify for the expansion of MedDRA to ICD mapping in UMLS.
first_indexed 2024-03-07T14:57:28Z
format Article
id doaj.art-68d5306c90bf4b5093e8e5c64eebea74
institution Directory Open Access Journal
issn 1472-6947
language English
last_indexed 2024-03-07T14:57:28Z
publishDate 2024-02-01
publisher BMC
record_format Article
series BMC Medical Informatics and Decision Making
spelling doaj.art-68d5306c90bf4b5093e8e5c64eebea742024-03-05T19:20:03ZengBMCBMC Medical Informatics and Decision Making1472-69472024-02-0123S411210.1186/s12911-023-02375-1Evaluating MedDRA-to-ICD terminology mappingsXinyuan Zhang0Yixue Feng1Fang Li2Jin Ding3Danyal Tahseen4Ezekiel Hinojosa5Yong Chen6Cui Tao7McWilliam School of Biomedical Informatics, University of Texas Health Science Center at HoustonSchool of Engineering and Applied Science, University of PennsylvaniaMcWilliam School of Biomedical Informatics, University of Texas Health Science Center at HoustonMcWilliam School of Biomedical Informatics, University of Texas Health Science Center at HoustonMcGovern Medical School, University of Texas Health Science Center at HoustonMcGovern Medical School, University of Texas Health Science Center at HoustonThe Perelman School of Medicine, University of PennsylvaniaMcWilliam School of Biomedical Informatics, University of Texas Health Science Center at HoustonAbstract Background In this era of big data, data harmonization is an important step to ensure reproducible, scalable, and collaborative research. Thus, terminology mapping is a necessary step to harmonize heterogeneous data. Take the Medical Dictionary for Regulatory Activities (MedDRA) and International Classification of Diseases (ICD) for example, the mapping between them is essential for drug safety and pharmacovigilance research. Our main objective is to provide a quantitative and qualitative analysis of the mapping status between MedDRA and ICD. We focus on evaluating the current mapping status between MedDRA and ICD through the Unified Medical Language System (UMLS) and Observational Medical Outcomes Partnership Common Data Model (OMOP CDM). We summarized the current mapping statistics and evaluated the quality of the current MedDRA-ICD mapping; for unmapped terms, we used our self-developed algorithm to rank the best possible mapping candidates for additional mapping coverage. Results The identified MedDRA-ICD mapped pairs cover 27.23% of the overall MedDRA preferred terms (PT). The systematic quality analysis demonstrated that, among the mapped pairs provided by UMLS, only 51.44% are considered an exact match. For the 2400 sampled unmapped terms, 56 of the 2400 MedDRA Preferred Terms (PT) could have exact match terms from ICD. Conclusion Some of the mapped pairs between MedDRA and ICD are not exact matches due to differences in granularity and focus. For 72% of the unmapped PT terms, the identified exact match pairs illustrate the possibility of identifying additional mapped pairs. Referring to its own mapping standard, some of the unmapped terms should qualify for the expansion of MedDRA to ICD mapping in UMLS.https://doi.org/10.1186/s12911-023-02375-1The medical dictionary for regulatory activities (MedDRA)International classification of diseases (ICD)Unified medical language system (UMLS)Observational medical outcomes partnership common data model (OMOP CDM)Terminology mapping
spellingShingle Xinyuan Zhang
Yixue Feng
Fang Li
Jin Ding
Danyal Tahseen
Ezekiel Hinojosa
Yong Chen
Cui Tao
Evaluating MedDRA-to-ICD terminology mappings
BMC Medical Informatics and Decision Making
The medical dictionary for regulatory activities (MedDRA)
International classification of diseases (ICD)
Unified medical language system (UMLS)
Observational medical outcomes partnership common data model (OMOP CDM)
Terminology mapping
title Evaluating MedDRA-to-ICD terminology mappings
title_full Evaluating MedDRA-to-ICD terminology mappings
title_fullStr Evaluating MedDRA-to-ICD terminology mappings
title_full_unstemmed Evaluating MedDRA-to-ICD terminology mappings
title_short Evaluating MedDRA-to-ICD terminology mappings
title_sort evaluating meddra to icd terminology mappings
topic The medical dictionary for regulatory activities (MedDRA)
International classification of diseases (ICD)
Unified medical language system (UMLS)
Observational medical outcomes partnership common data model (OMOP CDM)
Terminology mapping
url https://doi.org/10.1186/s12911-023-02375-1
work_keys_str_mv AT xinyuanzhang evaluatingmeddratoicdterminologymappings
AT yixuefeng evaluatingmeddratoicdterminologymappings
AT fangli evaluatingmeddratoicdterminologymappings
AT jinding evaluatingmeddratoicdterminologymappings
AT danyaltahseen evaluatingmeddratoicdterminologymappings
AT ezekielhinojosa evaluatingmeddratoicdterminologymappings
AT yongchen evaluatingmeddratoicdterminologymappings
AT cuitao evaluatingmeddratoicdterminologymappings