Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.
Most viruses are known to spontaneously generate defective viral genomes (DVG) due to errors during replication. These DVGs are subgenomic and contain deletions that render them unable to complete a full replication cycle in the absence of a co-infecting, non-defective helper virus. DVGs, especially...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2019-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0216944 |
_version_ | 1819108716955303936 |
---|---|
author | Trent J Bosma Konstantinos Karagiannis Luis Santana-Quintero Natalia Ilyushina Tatiana Zagorodnyaya Svetlana Petrovskaya Majid Laassri Raymond P Donnelly Steven Rubin Vahan Simonyan Christian J Sauder |
author_facet | Trent J Bosma Konstantinos Karagiannis Luis Santana-Quintero Natalia Ilyushina Tatiana Zagorodnyaya Svetlana Petrovskaya Majid Laassri Raymond P Donnelly Steven Rubin Vahan Simonyan Christian J Sauder |
author_sort | Trent J Bosma |
collection | DOAJ |
description | Most viruses are known to spontaneously generate defective viral genomes (DVG) due to errors during replication. These DVGs are subgenomic and contain deletions that render them unable to complete a full replication cycle in the absence of a co-infecting, non-defective helper virus. DVGs, especially of the copyback type, frequently observed with paramyxoviruses, have been recognized to be important triggers of the antiviral innate immune response. DVGs have therefore gained interest for their potential to alter the attenuation and immunogenicity of vaccines. To investigate this potential, accurate identification and quantification of DVGs is essential. Conventional methods, such as RT-PCR, are labor intensive and will only detect primer sequence-specific species. High throughput sequencing (HTS) is much better suited for this undertaking. Here, we present an HTS-based algorithm called DVG-profiler to identify and quantify all DVG sequences in an HTS data set generated from a virus preparation. DVG-profiler identifies DVG breakpoints relative to a reference genome and reports the directionality of each segment from within the same read. The specificity and sensitivity of the algorithm was assessed using both in silico data sets as well as HTS data obtained from parainfluenza virus 5, Sendai virus and mumps virus preparations. HTS data from the latter were also compared with conventional RT-PCR data and with data obtained using an alternative algorithm. The data presented here demonstrate the high specificity, sensitivity, and robustness of DVG-profiler. This algorithm was implemented within an open source cloud-based computing environment for analyzing HTS data. DVG-profiler might prove valuable not only in basic virus research but also in monitoring live attenuated vaccines for DVG content and to assure vaccine lot to lot consistency. |
first_indexed | 2024-12-22T03:14:22Z |
format | Article |
id | doaj.art-21d18c0f908a4ec790262c8d12cf6b0a |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-22T03:14:22Z |
publishDate | 2019-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-21d18c0f908a4ec790262c8d12cf6b0a2022-12-21T18:40:51ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01145e021694410.1371/journal.pone.0216944Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm.Trent J BosmaKonstantinos KaragiannisLuis Santana-QuinteroNatalia IlyushinaTatiana ZagorodnyayaSvetlana PetrovskayaMajid LaassriRaymond P DonnellySteven RubinVahan SimonyanChristian J SauderMost viruses are known to spontaneously generate defective viral genomes (DVG) due to errors during replication. These DVGs are subgenomic and contain deletions that render them unable to complete a full replication cycle in the absence of a co-infecting, non-defective helper virus. DVGs, especially of the copyback type, frequently observed with paramyxoviruses, have been recognized to be important triggers of the antiviral innate immune response. DVGs have therefore gained interest for their potential to alter the attenuation and immunogenicity of vaccines. To investigate this potential, accurate identification and quantification of DVGs is essential. Conventional methods, such as RT-PCR, are labor intensive and will only detect primer sequence-specific species. High throughput sequencing (HTS) is much better suited for this undertaking. Here, we present an HTS-based algorithm called DVG-profiler to identify and quantify all DVG sequences in an HTS data set generated from a virus preparation. DVG-profiler identifies DVG breakpoints relative to a reference genome and reports the directionality of each segment from within the same read. The specificity and sensitivity of the algorithm was assessed using both in silico data sets as well as HTS data obtained from parainfluenza virus 5, Sendai virus and mumps virus preparations. HTS data from the latter were also compared with conventional RT-PCR data and with data obtained using an alternative algorithm. The data presented here demonstrate the high specificity, sensitivity, and robustness of DVG-profiler. This algorithm was implemented within an open source cloud-based computing environment for analyzing HTS data. DVG-profiler might prove valuable not only in basic virus research but also in monitoring live attenuated vaccines for DVG content and to assure vaccine lot to lot consistency.https://doi.org/10.1371/journal.pone.0216944 |
spellingShingle | Trent J Bosma Konstantinos Karagiannis Luis Santana-Quintero Natalia Ilyushina Tatiana Zagorodnyaya Svetlana Petrovskaya Majid Laassri Raymond P Donnelly Steven Rubin Vahan Simonyan Christian J Sauder Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm. PLoS ONE |
title | Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm. |
title_full | Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm. |
title_fullStr | Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm. |
title_full_unstemmed | Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm. |
title_short | Identification and quantification of defective virus genomes in high throughput sequencing data using DVG-profiler, a novel post-sequence alignment processing algorithm. |
title_sort | identification and quantification of defective virus genomes in high throughput sequencing data using dvg profiler a novel post sequence alignment processing algorithm |
url | https://doi.org/10.1371/journal.pone.0216944 |
work_keys_str_mv | AT trentjbosma identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT konstantinoskaragiannis identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT luissantanaquintero identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT nataliailyushina identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT tatianazagorodnyaya identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT svetlanapetrovskaya identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT majidlaassri identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT raymondpdonnelly identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT stevenrubin identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT vahansimonyan identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm AT christianjsauder identificationandquantificationofdefectivevirusgenomesinhighthroughputsequencingdatausingdvgprofileranovelpostsequencealignmentprocessingalgorithm |