scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information

Abstract Background Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to capture transcriptomes at single-cell resolution. However, dropout events distort the gene expression levels and underlying biological signals, misleading the downstream analysis of scRNA-seq data. Results We deve...

Full description

Bibliographic Details
Main Authors: Jing Qi, Qiongyu Sheng, Yang Zhou, Jiao Hua, Shutong Xiao, Shuilin Jin
Format: Article
Language:English
Published: BMC 2022-09-01
Series:Cell & Bioscience
Subjects:
Online Access:https://doi.org/10.1186/s13578-022-00886-4
_version_ 1811274863752511488
author Jing Qi
Qiongyu Sheng
Yang Zhou
Jiao Hua
Shutong Xiao
Shuilin Jin
author_facet Jing Qi
Qiongyu Sheng
Yang Zhou
Jiao Hua
Shutong Xiao
Shuilin Jin
author_sort Jing Qi
collection DOAJ
description Abstract Background Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to capture transcriptomes at single-cell resolution. However, dropout events distort the gene expression levels and underlying biological signals, misleading the downstream analysis of scRNA-seq data. Results We develop a statistical model-based multidimensional imputation algorithm, scMTD, that identifies local cell neighbors and specific gene co-expression networks based on the pseudo-time of cells, leveraging information on cell-level, gene-level, and transcriptome dynamic to recover scRNA-seq data. Compared with the state-of-the-art imputation methods through several real-data-based analytical experiments, scMTD effectively recovers biological signals of transcriptomes and consistently outperforms the other algorithms in improving FISH validation, trajectory inference, differential expression analysis, clustering analysis, and identification of cell types. Conclusions scMTD maintains the gene expression characteristics, enhances the clustering of cell subpopulations, assists the study of gene expression dynamics, contributes to the discovery of rare cell types, and applies to both UMI-based and non-UMI-based data. Overall, scMTD’s reliability, applicability, and scalability make it a promising imputation approach for scRNA-seq data.
first_indexed 2024-04-12T23:26:46Z
format Article
id doaj.art-1b04e9d0009e4ff78e44a821f05a4a1b
institution Directory Open Access Journal
issn 2045-3701
language English
last_indexed 2024-04-12T23:26:46Z
publishDate 2022-09-01
publisher BMC
record_format Article
series Cell & Bioscience
spelling doaj.art-1b04e9d0009e4ff78e44a821f05a4a1b2022-12-22T03:12:23ZengBMCCell & Bioscience2045-37012022-09-0112111510.1186/s13578-022-00886-4scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic informationJing Qi0Qiongyu Sheng1Yang Zhou2Jiao Hua3Shutong Xiao4Shuilin Jin5School of Mathematics, Harbin Institute of TechnologySchool of Mathematics, Harbin Institute of TechnologySchool of Mathematics, Harbin Institute of TechnologySchool of Mathematics, Harbin Institute of TechnologySchool of Mathematics, Harbin Institute of TechnologySchool of Mathematics, Harbin Institute of TechnologyAbstract Background Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to capture transcriptomes at single-cell resolution. However, dropout events distort the gene expression levels and underlying biological signals, misleading the downstream analysis of scRNA-seq data. Results We develop a statistical model-based multidimensional imputation algorithm, scMTD, that identifies local cell neighbors and specific gene co-expression networks based on the pseudo-time of cells, leveraging information on cell-level, gene-level, and transcriptome dynamic to recover scRNA-seq data. Compared with the state-of-the-art imputation methods through several real-data-based analytical experiments, scMTD effectively recovers biological signals of transcriptomes and consistently outperforms the other algorithms in improving FISH validation, trajectory inference, differential expression analysis, clustering analysis, and identification of cell types. Conclusions scMTD maintains the gene expression characteristics, enhances the clustering of cell subpopulations, assists the study of gene expression dynamics, contributes to the discovery of rare cell types, and applies to both UMI-based and non-UMI-based data. Overall, scMTD’s reliability, applicability, and scalability make it a promising imputation approach for scRNA-seq data.https://doi.org/10.1186/s13578-022-00886-4Single-cell RNA-seqMultidimensional informationTranscriptome dynamicCell-levelGene-level
spellingShingle Jing Qi
Qiongyu Sheng
Yang Zhou
Jiao Hua
Shutong Xiao
Shuilin Jin
scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information
Cell & Bioscience
Single-cell RNA-seq
Multidimensional information
Transcriptome dynamic
Cell-level
Gene-level
title scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information
title_full scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information
title_fullStr scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information
title_full_unstemmed scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information
title_short scMTD: a statistical multidimensional imputation method for single-cell RNA-seq data leveraging transcriptome dynamic information
title_sort scmtd a statistical multidimensional imputation method for single cell rna seq data leveraging transcriptome dynamic information
topic Single-cell RNA-seq
Multidimensional information
Transcriptome dynamic
Cell-level
Gene-level
url https://doi.org/10.1186/s13578-022-00886-4
work_keys_str_mv AT jingqi scmtdastatisticalmultidimensionalimputationmethodforsinglecellrnaseqdataleveragingtranscriptomedynamicinformation
AT qiongyusheng scmtdastatisticalmultidimensionalimputationmethodforsinglecellrnaseqdataleveragingtranscriptomedynamicinformation
AT yangzhou scmtdastatisticalmultidimensionalimputationmethodforsinglecellrnaseqdataleveragingtranscriptomedynamicinformation
AT jiaohua scmtdastatisticalmultidimensionalimputationmethodforsinglecellrnaseqdataleveragingtranscriptomedynamicinformation
AT shutongxiao scmtdastatisticalmultidimensionalimputationmethodforsinglecellrnaseqdataleveragingtranscriptomedynamicinformation
AT shuilinjin scmtdastatisticalmultidimensionalimputationmethodforsinglecellrnaseqdataleveragingtranscriptomedynamicinformation