A compressed large language model embedding dataset of ICD 10 CM descriptions
Abstract This paper presents novel datasets providing numerical representations of ICD-10-CM codes by generating description embeddings using a large language model followed by a dimension reduction via autoencoder. The embeddings serve as informative input features for machine learning models by ca...
Main Authors: | Michael J. Kane, Casey King, Denise Esserman, Nancy K. Latham, Erich J. Greene, David A. Ganz |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2023-12-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-023-05597-2 |
Similar Items
-
Impact of the Transition from ICD-9-CM to ICD-10-CM on the Rates of Severe Maternal Morbidity in Arkansas: An Analysis of Claims Data
by: Mandana Rezaeiahari, et al.
Published: (2022-05-01) -
Validation of ICD-9-CM and ICD-10-CM Diagnostic Codes for Identifying Patients with Out-of-Hospital Cardiac Arrest in a National Health Insurance Claims Database
by: Tsai MJ, et al.
Published: (2022-05-01) -
The effect of transitioning to ICD-10-CM on acute injury surveillance of active duty service members
by: Matthew C. Inscore, et al.
Published: (2018-08-01) -
Diagnosis Coding for Clinicians: Core Knowledge and Transition to ICD-10
by: Davoren Chick, et al.
Published: (2014-06-01) -
Impact of ICD-9-CM to ICD-10-CM coding transition on trauma hospitalization trends among young adults in 12 states
by: Yuri V. Sebastião, et al.
Published: (2021-01-01)