CSS: cluster similarity spectrum integration of single-cell genomics data

Abstract It is a major challenge to integrate single-cell sequencing data across experiments, conditions, batches, time points, and other technical considerations. New computational methods are required that can integrate samples while simultaneously preserving biological information. Here, we propo...

Full description

Bibliographic Details
Main Authors: Zhisong He, Agnieska Brazovskaja, Sebastian Ebert, J. Gray Camp, Barbara Treutlein
Format: Article
Language:English
Published: BMC 2020-09-01
Series:Genome Biology
Online Access:http://link.springer.com/article/10.1186/s13059-020-02147-4
Description
Summary:Abstract It is a major challenge to integrate single-cell sequencing data across experiments, conditions, batches, time points, and other technical considerations. New computational methods are required that can integrate samples while simultaneously preserving biological information. Here, we propose an unsupervised reference-free data representation, cluster similarity spectrum (CSS), where each cell is represented by its similarities to clusters independently identified across samples. We show that CSS can be used to assess cellular heterogeneity and enable reconstruction of differentiation trajectories from cerebral organoid and other single-cell transcriptomic data, and to integrate data across experimental conditions and human individuals.
ISSN:1474-760X