DiMSum: an error model and pipeline for analyzing deep mutational scanning data and diagnosing common experimental pathologies

Abstract Deep mutational scanning (DMS) enables multiplexed measurement of the effects of thousands of variants of proteins, RNAs, and regulatory elements. Here, we present a customizable pipeline, DiMSum, that represents an end-to-end solution for obtaining variant fitness and error estimates from...

Full description

Bibliographic Details
Main Authors: Andre J. Faure, Jörn M. Schmiedel, Pablo Baeza-Centurion, Ben Lehner
Format: Article
Language:English
Published: BMC 2020-08-01
Series:Genome Biology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13059-020-02091-3
Description
Summary:Abstract Deep mutational scanning (DMS) enables multiplexed measurement of the effects of thousands of variants of proteins, RNAs, and regulatory elements. Here, we present a customizable pipeline, DiMSum, that represents an end-to-end solution for obtaining variant fitness and error estimates from raw sequencing data. A key innovation of DiMSum is the use of an interpretable error model that captures the main sources of variability arising in DMS workflows, outperforming previous methods. DiMSum is available as an R/Bioconda package and provides summary reports to help researchers diagnose common DMS pathologies and take remedial steps in their analyses.
ISSN:1474-760X