ZINBMM: a general mixture model for simultaneous clustering and gene selection using single-cell transcriptomic data

Abstract Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which...

Full description

Bibliographic Details
Main Authors: Yang Li, Mingcong Wu, Shuangge Ma, Mengyun Wu
Format: Article
Language:English
Published: BMC 2023-09-01
Series:Genome Biology
Subjects:
Online Access:https://doi.org/10.1186/s13059-023-03046-0
Description
Summary:Abstract Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which can promote biological understanding of cell heterogeneity. In this study, we propose a zero-inflated negative binomial mixture model (ZINBMM) that simultaneously achieves effective scRNA-seq data clustering and gene selection. ZINBMM conducts a systemic analysis on raw counts, accommodating both batch effects and dropout events. Simulations and the analysis of five scRNA-seq datasets demonstrate the practical applicability of ZINBMM.
ISSN:1474-760X