Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.

Most viruses are known to spontaneously generate defective viral genomes (DVG) due to errors during replication. These DVGs are subgenomic and contain deletions that render them unable to complete a full replication cycle in the absence of a co-infecting, non-defective helper virus. DVGs, especially...

Full description

Bibliographic Details
Main Authors: Trent J Bosma, Konstantinos Karagiannis, Luis Santana-Quintero, Natalia Ilyushina, Tatiana Zagorodnyaya, Svetlana Petrovskaya, Majid Laassri, Raymond P Donnelly, Steven Rubin, Vahan Simonyan, Christian J Sauder
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0216944
_version_ 1819108716955303936
author Trent J Bosma
Konstantinos Karagiannis
Luis Santana-Quintero
Natalia Ilyushina
Tatiana Zagorodnyaya
Svetlana Petrovskaya
Majid Laassri
Raymond P Donnelly
Steven Rubin
Vahan Simonyan
Christian J Sauder
author_facet Trent J Bosma
Konstantinos Karagiannis
Luis Santana-Quintero
Natalia Ilyushina
Tatiana Zagorodnyaya
Svetlana Petrovskaya
Majid Laassri
Raymond P Donnelly
Steven Rubin
Vahan Simonyan
Christian J Sauder
author_sort Trent J Bosma
collection DOAJ
description Most viruses are known to spontaneously generate defective viral genomes (DVG) due to errors during replication. These DVGs are subgenomic and contain deletions that render them unable to complete a full replication cycle in the absence of a co-infecting, non-defective helper virus. DVGs, especially of the copyback type, frequently observed with paramyxoviruses, have been recognized to be important triggers of the antiviral innate immune response. DVGs have therefore gained interest for their potential to alter the attenuation and immunogenicity of vaccines. To investigate this potential, accurate identification and quantification of DVGs is essential. Conventional methods, such as RT-PCR, are labor intensive and will only detect primer sequence-specific species. High throughput sequencing (HTS) is much better suited for this undertaking. Here, we present an HTS-based algorithm called DVG-profiler to identify and quantify all DVG sequences in an HTS data set generated from a virus preparation. DVG-profiler identifies DVG breakpoints relative to a reference genome and reports the directionality of each segment from within the same read. The specificity and sensitivity of the algorithm was assessed using both in silico data sets as well as HTS data obtained from parainfluenza virus 5, Sendai virus and mumps virus preparations. HTS data from the latter were also compared with conventional RT-PCR data and with data obtained using an alternative algorithm. The data presented here demonstrate the high specificity, sensitivity, and robustness of DVG-profiler. This algorithm was implemented within an open source cloud-based computing environment for analyzing HTS data. DVG-profiler might prove valuable not only in basic virus research but also in monitoring live attenuated vaccines for DVG content and to assure vaccine lot to lot consistency.
first_indexed 2024-12-22T03:14:22Z
format Article
id doaj.art-21d18c0f908a4ec790262c8d12cf6b0a
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-22T03:14:22Z
publishDate 2019-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-21d18c0f908a4ec790262c8d12cf6b0a2022-12-21T18:40:51ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01145e021694410.1371/journal.pone.0216944Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.Trent J BosmaKonstantinos KaragiannisLuis Santana-QuinteroNatalia IlyushinaTatiana ZagorodnyayaSvetlana PetrovskayaMajid LaassriRaymond P DonnellySteven RubinVahan SimonyanChristian J SauderMost viruses are known to spontaneously generate defective viral genomes (DVG) due to errors during replication. These DVGs are subgenomic and contain deletions that render them unable to complete a full replication cycle in the absence of a co-infecting, non-defective helper virus. DVGs, especially of the copyback type, frequently observed with paramyxoviruses, have been recognized to be important triggers of the antiviral innate immune response. DVGs have therefore gained interest for their potential to alter the attenuation and immunogenicity of vaccines. To investigate this potential, accurate identification and quantification of DVGs is essential. Conventional methods, such as RT-PCR, are labor intensive and will only detect primer sequence-specific species. High throughput sequencing (HTS) is much better suited for this undertaking. Here, we present an HTS-based algorithm called DVG-profiler to identify and quantify all DVG sequences in an HTS data set generated from a virus preparation. DVG-profiler identifies DVG breakpoints relative to a reference genome and reports the directionality of each segment from within the same read. The specificity and sensitivity of the algorithm was assessed using both in silico data sets as well as HTS data obtained from parainfluenza virus 5, Sendai virus and mumps virus preparations. HTS data from the latter were also compared with conventional RT-PCR data and with data obtained using an alternative algorithm. The data presented here demonstrate the high specificity, sensitivity, and robustness of DVG-profiler. This algorithm was implemented within an open source cloud-based computing environment for analyzing HTS data. DVG-profiler might prove valuable not only in basic virus research but also in monitoring live attenuated vaccines for DVG content and to assure vaccine lot to lot consistency.https://doi.org/10.1371/journal.pone.0216944
spellingShingle Trent J Bosma
Konstantinos Karagiannis
Luis Santana-Quintero
Natalia Ilyushina
Tatiana Zagorodnyaya
Svetlana Petrovskaya
Majid Laassri
Raymond P Donnelly
Steven Rubin
Vahan Simonyan
Christian J Sauder
Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
PLoS ONE
title Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
title_full Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
title_fullStr Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
title_full_unstemmed Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
title_short Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
title_sort identification and quantification of defective virus genomes in high throughput sequencing data using dvg profiler a novel post sequence alignment processing algorithm
url https://doi.org/10.1371/journal.pone.0216944
work_keys_str_mv AT trentjbosma identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT konstantinoskaragiannis identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT luissantanaquintero identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT nataliailyushina identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT tatianazagorodnyaya identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT svetlanapetrovskaya identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT majidlaassri identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT raymondpdonnelly identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT stevenrubin identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT vahansimonyan identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm
AT christianjsauder identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm