Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation
Since the beginning of the COVID-19 pandemic, SARS-CoV-2 has demonstrated its ability to rapidly and continuously evolve, leading to the emergence of thousands of different sequence variants, many with distinctive phenotypic properties. Fortunately, the broad application of next generation sequencin...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2022-10-01
|
Series: | Frontiers in Bioinformatics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fbinf.2022.1020189/full |
_version_ | 1811235643430273024 |
---|---|
author | Zachary S. Wallace Zachary S. Wallace James Davis James Davis Anna Maria Niewiadomska Robert D. Olson Robert D. Olson Maulik Shukla Maulik Shukla Rick Stevens Rick Stevens Yun Zhang Christian M. Zmasek Richard H. Scheuermann Richard H. Scheuermann Richard H. Scheuermann Richard H. Scheuermann |
author_facet | Zachary S. Wallace Zachary S. Wallace James Davis James Davis Anna Maria Niewiadomska Robert D. Olson Robert D. Olson Maulik Shukla Maulik Shukla Rick Stevens Rick Stevens Yun Zhang Christian M. Zmasek Richard H. Scheuermann Richard H. Scheuermann Richard H. Scheuermann Richard H. Scheuermann |
author_sort | Zachary S. Wallace |
collection | DOAJ |
description | Since the beginning of the COVID-19 pandemic, SARS-CoV-2 has demonstrated its ability to rapidly and continuously evolve, leading to the emergence of thousands of different sequence variants, many with distinctive phenotypic properties. Fortunately, the broad application of next generation sequencing (NGS) across the globe has produced a wealth of SARS-CoV-2 genome sequences, offering a comprehensive picture of how this virus is evolving so that accurate diagnostics, reliable therapeutics, and prophylactic vaccines against COVID-19 can be developed and maintained. The millions of SARS-CoV-2 sequences deposited into genomic sequencing databases, including GenBank, BV-BRC, and GISAID, are annotated with the dates and geographic locations of sample collection, and can be aligned to and compared with the Wuhan-Hu-1 reference genome to extract their constellation of nucleotide and amino acid substitutions. By aggregating these data into concise datasets, the spread of variants through space and time can be assessed. Variant tracking efforts have initially focused on the Spike protein due to its critical role in viral tropism and antibody neutralization. To identify emerging variants of concern as early as possible, we developed a computational pipeline to process the genomic data and assign risk scores based on both epidemiological and functional parameters. Epidemiological dynamics are used to identify variants exhibiting substantial growth over time and spread across geographical regions. Experimental data that quantify Spike protein regions targeted by adaptive immunity and critical for other virus characteristics are used to predict variants with consequential immunogenic and pathogenic impacts. The growth assessment and functional impact scores are combined to produce a Composite Score for any set of Spike substitutions detected. With this systematic method to routinely score and rank emerging variants, we have established an approach to identify threatening variants early and prioritize them for experimental evaluation. |
first_indexed | 2024-04-12T11:55:27Z |
format | Article |
id | doaj.art-fbea6d946a9a470e922b5d9464c8e896 |
institution | Directory Open Access Journal |
issn | 2673-7647 |
language | English |
last_indexed | 2024-04-12T11:55:27Z |
publishDate | 2022-10-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Bioinformatics |
spelling | doaj.art-fbea6d946a9a470e922b5d9464c8e8962022-12-22T03:34:02ZengFrontiers Media S.A.Frontiers in Bioinformatics2673-76472022-10-01210.3389/fbinf.2022.10201891020189Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluationZachary S. Wallace0Zachary S. Wallace1James Davis2James Davis3Anna Maria Niewiadomska4Robert D. Olson5Robert D. Olson6Maulik Shukla7Maulik Shukla8Rick Stevens9Rick Stevens10Yun Zhang11Christian M. Zmasek12Richard H. Scheuermann13Richard H. Scheuermann14Richard H. Scheuermann15Richard H. Scheuermann16Department of Informatics, J. Craig Venter Institute, La Jolla, CA, United StatesDepartment of Computer Science and Engineering, University of California, San Diego, San Diego, CA, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesUniversity of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, United StatesDepartment of Informatics, J. Craig Venter Institute, La Jolla, CA, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesUniversity of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesUniversity of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, United StatesComputing Environment and Life Sciences, Argonne National Laboratory, Argonne, IL, United StatesDepartment of Computer Science, University of Chicago, Chicago, IL, United StatesDepartment of Informatics, J. Craig Venter Institute, La Jolla, CA, United StatesDepartment of Informatics, J. Craig Venter Institute, La Jolla, CA, United StatesDepartment of Informatics, J. Craig Venter Institute, La Jolla, CA, United StatesDepartment of Pathology, University of California, San Diego, San Diego, CA, United StatesDivision of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, CA, United StatesGlobal Virus Network, Baltimore, MD, United StatesSince the beginning of the COVID-19 pandemic, SARS-CoV-2 has demonstrated its ability to rapidly and continuously evolve, leading to the emergence of thousands of different sequence variants, many with distinctive phenotypic properties. Fortunately, the broad application of next generation sequencing (NGS) across the globe has produced a wealth of SARS-CoV-2 genome sequences, offering a comprehensive picture of how this virus is evolving so that accurate diagnostics, reliable therapeutics, and prophylactic vaccines against COVID-19 can be developed and maintained. The millions of SARS-CoV-2 sequences deposited into genomic sequencing databases, including GenBank, BV-BRC, and GISAID, are annotated with the dates and geographic locations of sample collection, and can be aligned to and compared with the Wuhan-Hu-1 reference genome to extract their constellation of nucleotide and amino acid substitutions. By aggregating these data into concise datasets, the spread of variants through space and time can be assessed. Variant tracking efforts have initially focused on the Spike protein due to its critical role in viral tropism and antibody neutralization. To identify emerging variants of concern as early as possible, we developed a computational pipeline to process the genomic data and assign risk scores based on both epidemiological and functional parameters. Epidemiological dynamics are used to identify variants exhibiting substantial growth over time and spread across geographical regions. Experimental data that quantify Spike protein regions targeted by adaptive immunity and critical for other virus characteristics are used to predict variants with consequential immunogenic and pathogenic impacts. The growth assessment and functional impact scores are combined to produce a Composite Score for any set of Spike substitutions detected. With this systematic method to routinely score and rank emerging variants, we have established an approach to identify threatening variants early and prioritize them for experimental evaluation.https://www.frontiersin.org/articles/10.3389/fbinf.2022.1020189/fullSARS-CoV-2variants of concernomicrondeltaviral evolutiongenomic surveillance |
spellingShingle | Zachary S. Wallace Zachary S. Wallace James Davis James Davis Anna Maria Niewiadomska Robert D. Olson Robert D. Olson Maulik Shukla Maulik Shukla Rick Stevens Rick Stevens Yun Zhang Christian M. Zmasek Richard H. Scheuermann Richard H. Scheuermann Richard H. Scheuermann Richard H. Scheuermann Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation Frontiers in Bioinformatics SARS-CoV-2 variants of concern omicron delta viral evolution genomic surveillance |
title | Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation |
title_full | Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation |
title_fullStr | Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation |
title_full_unstemmed | Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation |
title_short | Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation |
title_sort | early detection of emerging sars cov 2 variants of interest for experimental evaluation |
topic | SARS-CoV-2 variants of concern omicron delta viral evolution genomic surveillance |
url | https://www.frontiersin.org/articles/10.3389/fbinf.2022.1020189/full |
work_keys_str_mv | AT zacharyswallace earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT zacharyswallace earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT jamesdavis earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT jamesdavis earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT annamarianiewiadomska earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT robertdolson earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT robertdolson earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT maulikshukla earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT maulikshukla earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT rickstevens earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT rickstevens earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT yunzhang earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT christianmzmasek earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT richardhscheuermann earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT richardhscheuermann earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT richardhscheuermann earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation AT richardhscheuermann earlydetectionofemergingsarscov2variantsofinterestforexperimentalevaluation |