Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the causal agent of the COVID-19 pandemic that emerged in late 2019. The outbreak of variants with mutations in the region encoding the spike protein S1 sub-unit that can make them more resistant to neutralizing or monoclonal antib...

Full description

Bibliographic Details
Main Authors: Sébastien Lhomme, Justine Latour, Nicolas Jeanne, Pauline Trémeaux, Noémie Ranger, Marion Migueres, Gérald Salin, Cécile Donnadieu, Jacques Izopet
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:Viruses
Subjects:
Online Access:https://www.mdpi.com/1999-4915/13/12/2544
_version_ 1797499900636692480
author Sébastien Lhomme
Justine Latour
Nicolas Jeanne
Pauline Trémeaux
Noémie Ranger
Marion Migueres
Gérald Salin
Cécile Donnadieu
Jacques Izopet
author_facet Sébastien Lhomme
Justine Latour
Nicolas Jeanne
Pauline Trémeaux
Noémie Ranger
Marion Migueres
Gérald Salin
Cécile Donnadieu
Jacques Izopet
author_sort Sébastien Lhomme
collection DOAJ
description The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the causal agent of the COVID-19 pandemic that emerged in late 2019. The outbreak of variants with mutations in the region encoding the spike protein S1 sub-unit that can make them more resistant to neutralizing or monoclonal antibodies is the main point of the current monitoring. This study examines the feasibility of predicting the variant lineage and monitoring the appearance of reported mutations by sequencing only the region encoding the S1 domain by Pacific Bioscience Single Molecule Real-Time sequencing (PacBio SMRT). Using the PacBio SMRT system, we successfully sequenced 186 of the 200 samples previously sequenced with the Illumina COVIDSeq (whole genome) system. PacBio SMRT detected mutations in the S1 domain that were missed by the COVIDseq system in 27/186 samples (14.5%), due to amplification failure. These missing positions included mutations that are decisive for lineage assignation, such as G142D (<i>n</i> = 11), N501Y (<i>n</i> = 6), or E484K (<i>n</i> = 2). The lineage of 172/186 (92.5%) samples was accurately determined by analyzing the region encoding the S1 domain with a pipeline that uses key positions in S1. Thus, the PacBio SMRT protocol is appropriate for determining virus lineages and detecting key mutations.
first_indexed 2024-03-10T03:54:05Z
format Article
id doaj.art-5730af35e40144438f805673a8831718
institution Directory Open Access Journal
issn 1999-4915
language English
last_indexed 2024-03-10T03:54:05Z
publishDate 2021-12-01
publisher MDPI AG
record_format Article
series Viruses
spelling doaj.art-5730af35e40144438f805673a88317182023-11-23T10:59:27ZengMDPI AGViruses1999-49152021-12-011312254410.3390/v13122544Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time SequencingSébastien Lhomme0Justine Latour1Nicolas Jeanne2Pauline Trémeaux3Noémie Ranger4Marion Migueres5Gérald Salin6Cécile Donnadieu7Jacques Izopet8Infinity, Université Toulouse, CNRS, INSERM, UPS, 31300 Toulouse, FranceLaboratoire de Virologie, CHU Toulouse, Hôpital Purpan, 31300 Toulouse, FranceLaboratoire de Virologie, CHU Toulouse, Hôpital Purpan, 31300 Toulouse, FranceLaboratoire de Virologie, CHU Toulouse, Hôpital Purpan, 31300 Toulouse, FranceLaboratoire de Virologie, CHU Toulouse, Hôpital Purpan, 31300 Toulouse, FranceInfinity, Université Toulouse, CNRS, INSERM, UPS, 31300 Toulouse, FranceINRAE, US 1426, GeT-PlaGe, Genotoul, 31326 Castanet-Tolosan, FranceINRAE, US 1426, GeT-PlaGe, Genotoul, 31326 Castanet-Tolosan, FranceInfinity, Université Toulouse, CNRS, INSERM, UPS, 31300 Toulouse, FranceThe severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the causal agent of the COVID-19 pandemic that emerged in late 2019. The outbreak of variants with mutations in the region encoding the spike protein S1 sub-unit that can make them more resistant to neutralizing or monoclonal antibodies is the main point of the current monitoring. This study examines the feasibility of predicting the variant lineage and monitoring the appearance of reported mutations by sequencing only the region encoding the S1 domain by Pacific Bioscience Single Molecule Real-Time sequencing (PacBio SMRT). Using the PacBio SMRT system, we successfully sequenced 186 of the 200 samples previously sequenced with the Illumina COVIDSeq (whole genome) system. PacBio SMRT detected mutations in the S1 domain that were missed by the COVIDseq system in 27/186 samples (14.5%), due to amplification failure. These missing positions included mutations that are decisive for lineage assignation, such as G142D (<i>n</i> = 11), N501Y (<i>n</i> = 6), or E484K (<i>n</i> = 2). The lineage of 172/186 (92.5%) samples was accurately determined by analyzing the region encoding the S1 domain with a pipeline that uses key positions in S1. Thus, the PacBio SMRT protocol is appropriate for determining virus lineages and detecting key mutations.https://www.mdpi.com/1999-4915/13/12/2544SARS-CoV-2PacBio SMRT sequencingIllumina sequencingS1 domaincladelineage
spellingShingle Sébastien Lhomme
Justine Latour
Nicolas Jeanne
Pauline Trémeaux
Noémie Ranger
Marion Migueres
Gérald Salin
Cécile Donnadieu
Jacques Izopet
Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
Viruses
SARS-CoV-2
PacBio SMRT sequencing
Illumina sequencing
S1 domain
clade
lineage
title Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
title_full Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
title_fullStr Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
title_full_unstemmed Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
title_short Prediction of SARS-CoV-2 Variant Lineages Using the S1-Encoding Region Sequence Obtained by PacBio Single-Molecule Real-Time Sequencing
title_sort prediction of sars cov 2 variant lineages using the s1 encoding region sequence obtained by pacbio single molecule real time sequencing
topic SARS-CoV-2
PacBio SMRT sequencing
Illumina sequencing
S1 domain
clade
lineage
url https://www.mdpi.com/1999-4915/13/12/2544
work_keys_str_mv AT sebastienlhomme predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT justinelatour predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT nicolasjeanne predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT paulinetremeaux predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT noemieranger predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT marionmigueres predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT geraldsalin predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT ceciledonnadieu predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing
AT jacquesizopet predictionofsarscov2variantlineagesusingthes1encodingregionsequenceobtainedbypacbiosinglemoleculerealtimesequencing