Length biases in single-cell RNA sequencing of pre-mRNA

Single-cell RNA sequencing data can be modeled using Markov chains to yield genome-wide insights into transcriptional physics. However, quantitative inference with such data requires careful assessment of noise sources. We find that long pre-mRNA transcripts are over-represented in sequencing data....

Full description

Bibliographic Details
Main Authors: Gennady Gorin, Lior Pachter
Format: Article
Language:English
Published: Elsevier 2023-03-01
Series:Biophysical Reports
Online Access:http://www.sciencedirect.com/science/article/pii/S2667074722000544
Description
Summary:Single-cell RNA sequencing data can be modeled using Markov chains to yield genome-wide insights into transcriptional physics. However, quantitative inference with such data requires careful assessment of noise sources. We find that long pre-mRNA transcripts are over-represented in sequencing data. To explain this trend, we propose a length-based model of capture bias, which may produce false-positive observations. We solve this model and use it to find concordant parameter trends as well as systematic, mechanistically interpretable technical and biological differences in paired data sets.
ISSN:2667-0747