The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques

Barcoding techniques are used to reduce error from next-generation sequencing, with applications ranging from understanding tumor subclone populations to detecting circulating tumor DNA. Collisions occur when more than one sample molecule is tagged by the same unique identifier (UID) and can result...

Full description

Bibliographic Details
Main Authors: Jenna VanLiere Canzoniero, Karen Cravero, Ben Ho Park
Format: Article
Language:English
Published: SAGE Publishing 2017-07-01
Series:Cancer Informatics
Online Access:https://doi.org/10.1177/1176935117719236
_version_ 1831614291346718720
author Jenna VanLiere Canzoniero
Karen Cravero
Ben Ho Park
author_facet Jenna VanLiere Canzoniero
Karen Cravero
Ben Ho Park
author_sort Jenna VanLiere Canzoniero
collection DOAJ
description Barcoding techniques are used to reduce error from next-generation sequencing, with applications ranging from understanding tumor subclone populations to detecting circulating tumor DNA. Collisions occur when more than one sample molecule is tagged by the same unique identifier (UID) and can result in failure to detect very-low-frequency mutations and error in estimating mutation frequency. Here, we created computer models of barcoding technique, with and without amplification bias introduced by the UID, and analyzed the effect of collisions for a range of mutant allele frequencies (1e−6 to 0.2), number of sample molecules (10 000 to 1e7), and number of UIDs (4 10 -4 14 ). Inability to detect rare mutant alleles occurred in 0% to 100% of simulations, depending on collisions and number of mutant molecules. Collisions also introduced error in estimating mutant allele frequency resulting in underestimation of minor allele frequency. Incorporating an understanding of the effect of collisions into experimental design can allow for optimization of the number of sample molecules and number of UIDs to minimize the negative impact on rare mutant detection and mutant frequency estimation.
first_indexed 2024-12-18T19:26:39Z
format Article
id doaj.art-0d334c66506648a1bbe8621b84b43f6c
institution Directory Open Access Journal
issn 1176-9351
language English
last_indexed 2024-12-18T19:26:39Z
publishDate 2017-07-01
publisher SAGE Publishing
record_format Article
series Cancer Informatics
spelling doaj.art-0d334c66506648a1bbe8621b84b43f6c2022-12-21T20:55:51ZengSAGE PublishingCancer Informatics1176-93512017-07-011610.1177/1176935117719236The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing TechniquesJenna VanLiere Canzoniero0Karen Cravero1Ben Ho Park2Division of General Internal Medicine, Johns Hopkins Medicine, Baltimore, MD, USAThe Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins Medicine, Baltimore, MD, USAThe Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins Medicine, Baltimore, MD, USABarcoding techniques are used to reduce error from next-generation sequencing, with applications ranging from understanding tumor subclone populations to detecting circulating tumor DNA. Collisions occur when more than one sample molecule is tagged by the same unique identifier (UID) and can result in failure to detect very-low-frequency mutations and error in estimating mutation frequency. Here, we created computer models of barcoding technique, with and without amplification bias introduced by the UID, and analyzed the effect of collisions for a range of mutant allele frequencies (1e−6 to 0.2), number of sample molecules (10 000 to 1e7), and number of UIDs (4 10 -4 14 ). Inability to detect rare mutant alleles occurred in 0% to 100% of simulations, depending on collisions and number of mutant molecules. Collisions also introduced error in estimating mutant allele frequency resulting in underestimation of minor allele frequency. Incorporating an understanding of the effect of collisions into experimental design can allow for optimization of the number of sample molecules and number of UIDs to minimize the negative impact on rare mutant detection and mutant frequency estimation.https://doi.org/10.1177/1176935117719236
spellingShingle Jenna VanLiere Canzoniero
Karen Cravero
Ben Ho Park
The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques
Cancer Informatics
title The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques
title_full The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques
title_fullStr The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques
title_full_unstemmed The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques
title_short The Impact of Collisions on the Ability to Detect Rare Mutant Alleles Using Barcode-Type Next-Generation Sequencing Techniques
title_sort impact of collisions on the ability to detect rare mutant alleles using barcode type next generation sequencing techniques
url https://doi.org/10.1177/1176935117719236
work_keys_str_mv AT jennavanlierecanzoniero theimpactofcollisionsontheabilitytodetectraremutantallelesusingbarcodetypenextgenerationsequencingtechniques
AT karencravero theimpactofcollisionsontheabilitytodetectraremutantallelesusingbarcodetypenextgenerationsequencingtechniques
AT benhopark theimpactofcollisionsontheabilitytodetectraremutantallelesusingbarcodetypenextgenerationsequencingtechniques
AT jennavanlierecanzoniero impactofcollisionsontheabilitytodetectraremutantallelesusingbarcodetypenextgenerationsequencingtechniques
AT karencravero impactofcollisionsontheabilitytodetectraremutantallelesusingbarcodetypenextgenerationsequencingtechniques
AT benhopark impactofcollisionsontheabilitytodetectraremutantallelesusingbarcodetypenextgenerationsequencingtechniques