Comparative visual analytics for assessing medical records with sequence embedding

Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare. Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence. However, such analysis is not straightforwa...

Full description

Bibliographic Details
Main Authors: Rongchen Guo, Takanori Fujiwara, Yiran Li, Kelly M. Lima, Soman Sen, Nam K. Tran, Kwan-Liu Ma
Format: Article
Language:English
Published: Elsevier 2020-06-01
Series:Visual Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2468502X20300139
_version_ 1819041271583342592
author Rongchen Guo
Takanori Fujiwara
Yiran Li
Kelly M. Lima
Soman Sen
Nam K. Tran
Kwan-Liu Ma
author_facet Rongchen Guo
Takanori Fujiwara
Yiran Li
Kelly M. Lima
Soman Sen
Nam K. Tran
Kwan-Liu Ma
author_sort Rongchen Guo
collection DOAJ
description Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare. Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence. However, such analysis is not straightforward due to the characteristics of medical records: high dimensionality, irregularity in time, and sparsity. To address this challenge, we introduce a method for similarity calculation of medical records. Our method employs event and sequence embeddings. While we use an autoencoder for the event embedding, we apply its variant with the self-attention mechanism for the sequence embedding. Moreover, in order to better handle the irregularity of data, we enhance the self-attention mechanism with consideration of different time intervals. We have developed a visual analytics system to support comparative studies of patient records. To make a comparison of sequences with different lengths easier, our system incorporates a sequence alignment method. Through its interactive interface, the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records. We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.
first_indexed 2024-12-21T09:22:21Z
format Article
id doaj.art-d1a6a204968d4e85b31e2655bdb6de30
institution Directory Open Access Journal
issn 2468-502X
language English
last_indexed 2024-12-21T09:22:21Z
publishDate 2020-06-01
publisher Elsevier
record_format Article
series Visual Informatics
spelling doaj.art-d1a6a204968d4e85b31e2655bdb6de302022-12-21T19:08:59ZengElsevierVisual Informatics2468-502X2020-06-01427285Comparative visual analytics for assessing medical records with sequence embeddingRongchen Guo0Takanori Fujiwara1Yiran Li2Kelly M. Lima3Soman Sen4Nam K. Tran5Kwan-Liu Ma6Department of Computer Science, Beihang University, Beijing, China; Corresponding author.Department of Computer Science, University of California, Davis, United StatesDepartment of Computer Science, University of California, Davis, United StatesDepartment of Pathology and Laboratory Medicine, University of California, Davis, United StatesDepartment of Surgery, University of California, Davis, United StatesDepartment of Pathology and Laboratory Medicine, University of California, Davis, United StatesDepartment of Computer Science, University of California, Davis, United StatesMachine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare. Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence. However, such analysis is not straightforward due to the characteristics of medical records: high dimensionality, irregularity in time, and sparsity. To address this challenge, we introduce a method for similarity calculation of medical records. Our method employs event and sequence embeddings. While we use an autoencoder for the event embedding, we apply its variant with the self-attention mechanism for the sequence embedding. Moreover, in order to better handle the irregularity of data, we enhance the self-attention mechanism with consideration of different time intervals. We have developed a visual analytics system to support comparative studies of patient records. To make a comparison of sequences with different lengths easier, our system incorporates a sequence alignment method. Through its interactive interface, the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records. We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.http://www.sciencedirect.com/science/article/pii/S2468502X20300139Electronic medical recordsEvent sequence dataAutoencoderSelf-attentionSequence similarityVisual analytics
spellingShingle Rongchen Guo
Takanori Fujiwara
Yiran Li
Kelly M. Lima
Soman Sen
Nam K. Tran
Kwan-Liu Ma
Comparative visual analytics for assessing medical records with sequence embedding
Visual Informatics
Electronic medical records
Event sequence data
Autoencoder
Self-attention
Sequence similarity
Visual analytics
title Comparative visual analytics for assessing medical records with sequence embedding
title_full Comparative visual analytics for assessing medical records with sequence embedding
title_fullStr Comparative visual analytics for assessing medical records with sequence embedding
title_full_unstemmed Comparative visual analytics for assessing medical records with sequence embedding
title_short Comparative visual analytics for assessing medical records with sequence embedding
title_sort comparative visual analytics for assessing medical records with sequence embedding
topic Electronic medical records
Event sequence data
Autoencoder
Self-attention
Sequence similarity
Visual analytics
url http://www.sciencedirect.com/science/article/pii/S2468502X20300139
work_keys_str_mv AT rongchenguo comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding
AT takanorifujiwara comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding
AT yiranli comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding
AT kellymlima comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding
AT somansen comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding
AT namktran comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding
AT kwanliuma comparativevisualanalyticsforassessingmedicalrecordswithsequenceembedding