DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data

Abstract Background Advances in droplet-based single-cell RNA-sequencing (scRNA-seq) have dramatically increased throughput, allowing tens of thousands of cells to be routinely sequenced in a single experiment. In addition to cells, droplets capture cell-free “ambient” RNA predominantly caused by ly...

Full description

Bibliographic Details
Main Authors: Walter Muskovic, Joseph E. Powell
Format: Article
Language:English
Published: BMC 2021-12-01
Series:Genome Biology
Online Access:https://doi.org/10.1186/s13059-021-02547-0
_version_ 1818983189542076416
author Walter Muskovic
Joseph E. Powell
author_facet Walter Muskovic
Joseph E. Powell
author_sort Walter Muskovic
collection DOAJ
description Abstract Background Advances in droplet-based single-cell RNA-sequencing (scRNA-seq) have dramatically increased throughput, allowing tens of thousands of cells to be routinely sequenced in a single experiment. In addition to cells, droplets capture cell-free “ambient” RNA predominantly caused by lysis of cells during sample preparation. Samples with high ambient RNA concentration can create challenges in accurately distinguishing cell-containing droplets and droplets containing ambient RNA. Current methods to separate these groups often retain a significant number of droplets that do not contain cells or empty droplets. Additionally, there are currently no methods available to detect droplets containing damaged cells, which comprise partially lysed cells, the original source of the ambient RNA. Results Here, we describe DropletQC, a new method that is able to detect empty droplets, damaged, and intact cells, and accurately distinguish them from one another. This approach is based on a novel quality control metric, the nuclear fraction, which quantifies for each droplet the fraction of RNA originating from unspliced, nuclear pre-mRNA. We demonstrate how DropletQC provides a powerful extension to existing computational methods for identifying empty droplets such as EmptyDrops. Conclusions We implement DropletQC as an R package, which can be easily integrated into existing single-cell analysis workflows.
first_indexed 2024-12-20T17:59:09Z
format Article
id doaj.art-87314464a784446a98f63d2b1ca32ddb
institution Directory Open Access Journal
issn 1474-760X
language English
last_indexed 2024-12-20T17:59:09Z
publishDate 2021-12-01
publisher BMC
record_format Article
series Genome Biology
spelling doaj.art-87314464a784446a98f63d2b1ca32ddb2022-12-21T19:30:41ZengBMCGenome Biology1474-760X2021-12-012211910.1186/s13059-021-02547-0DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq dataWalter Muskovic0Joseph E. Powell1Garvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, The Kinghorn Cancer CentreGarvan Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, The Kinghorn Cancer CentreAbstract Background Advances in droplet-based single-cell RNA-sequencing (scRNA-seq) have dramatically increased throughput, allowing tens of thousands of cells to be routinely sequenced in a single experiment. In addition to cells, droplets capture cell-free “ambient” RNA predominantly caused by lysis of cells during sample preparation. Samples with high ambient RNA concentration can create challenges in accurately distinguishing cell-containing droplets and droplets containing ambient RNA. Current methods to separate these groups often retain a significant number of droplets that do not contain cells or empty droplets. Additionally, there are currently no methods available to detect droplets containing damaged cells, which comprise partially lysed cells, the original source of the ambient RNA. Results Here, we describe DropletQC, a new method that is able to detect empty droplets, damaged, and intact cells, and accurately distinguish them from one another. This approach is based on a novel quality control metric, the nuclear fraction, which quantifies for each droplet the fraction of RNA originating from unspliced, nuclear pre-mRNA. We demonstrate how DropletQC provides a powerful extension to existing computational methods for identifying empty droplets such as EmptyDrops. Conclusions We implement DropletQC as an R package, which can be easily integrated into existing single-cell analysis workflows.https://doi.org/10.1186/s13059-021-02547-0
spellingShingle Walter Muskovic
Joseph E. Powell
DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
Genome Biology
title DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_full DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_fullStr DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_full_unstemmed DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_short DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data
title_sort dropletqc improved identification of empty droplets and damaged cells in single cell rna seq data
url https://doi.org/10.1186/s13059-021-02547-0
work_keys_str_mv AT waltermuskovic dropletqcimprovedidentificationofemptydropletsanddamagedcellsinsinglecellrnaseqdata
AT josephepowell dropletqcimprovedidentificationofemptydropletsanddamagedcellsinsinglecellrnaseqdata