Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory

The spoligotype is a graphical description of the CRISPR locus present in <i>Mycobacterium tuberculosis</i>, which has the particularity of having only 68 possible spacers. This spoligotype, which can be easily obtained either in vitro or in silico, allows to have a summary information o...

Full description

Bibliographic Details
Main Authors: Gaetan Senelle, Christophe Guyeux, Guislaine Refrégier, Christophe Sola
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/13/12/2328
_version_ 1797457863648477184
author Gaetan Senelle
Christophe Guyeux
Guislaine Refrégier
Christophe Sola
author_facet Gaetan Senelle
Christophe Guyeux
Guislaine Refrégier
Christophe Sola
author_sort Gaetan Senelle
collection DOAJ
description The spoligotype is a graphical description of the CRISPR locus present in <i>Mycobacterium tuberculosis</i>, which has the particularity of having only 68 possible spacers. This spoligotype, which can be easily obtained either in vitro or in silico, allows to have a summary information of lineage or even antibiotic resistance (when known to be associated to a particular cluster) at a lower cost. The objective of this article is to show that this representation is richer than it seems, and that it is under-exploited until now. We first recall an original way to represent these spoligotypes as points in the plane, allowing to highlight possible sub-lineages, particularities in the animal strains, etc. This graphical representation shows clusters and a skeleton in the form of a graph, which led us to see these spoligotypes as vertices of an unconnected directed graph. In this paper, we therefore propose to exploit in detail the description of the variety of spoligotypes using a graph, and we show to what extent such a description can be informative.
first_indexed 2024-03-09T16:28:52Z
format Article
id doaj.art-d63094735b6745fe91c401fc4d292d52
institution Directory Open Access Journal
issn 2073-4425
language English
last_indexed 2024-03-09T16:28:52Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Genes
spelling doaj.art-d63094735b6745fe91c401fc4d292d522023-11-24T15:05:05ZengMDPI AGGenes2073-44252022-12-011312232810.3390/genes13122328Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph TheoryGaetan Senelle0Christophe Guyeux1Guislaine Refrégier2Christophe Sola3FEMTO-ST Institute, UMR 6174 CNRS, Université de Bourgogne-Franche-Comté, Burgundy-Franche-Comte, 90000 Belfort, FranceFEMTO-ST Institute, UMR 6174 CNRS, Université de Bourgogne-Franche-Comté, Burgundy-Franche-Comte, 90000 Belfort, FranceUniversité Paris-Saclay, CNRS, AgroParisTech, Ecologie Systématique Evolution, 91405 Gif-sur-Yvette, FranceUniversité Paris-Saclay, 91190 Saint Aubin, FranceThe spoligotype is a graphical description of the CRISPR locus present in <i>Mycobacterium tuberculosis</i>, which has the particularity of having only 68 possible spacers. This spoligotype, which can be easily obtained either in vitro or in silico, allows to have a summary information of lineage or even antibiotic resistance (when known to be associated to a particular cluster) at a lower cost. The objective of this article is to show that this representation is richer than it seems, and that it is under-exploited until now. We first recall an original way to represent these spoligotypes as points in the plane, allowing to highlight possible sub-lineages, particularities in the animal strains, etc. This graphical representation shows clusters and a skeleton in the form of a graph, which led us to see these spoligotypes as vertices of an unconnected directed graph. In this paper, we therefore propose to exploit in detail the description of the variety of spoligotypes using a graph, and we show to what extent such a description can be informative.https://www.mdpi.com/2073-4425/13/12/2328<i>Mycobacterium tuberculosis</i>CRISPRdimensionality reductiongraph theory
spellingShingle Gaetan Senelle
Christophe Guyeux
Guislaine Refrégier
Christophe Sola
Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory
Genes
<i>Mycobacterium tuberculosis</i>
CRISPR
dimensionality reduction
graph theory
title Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory
title_full Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory
title_fullStr Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory
title_full_unstemmed Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory
title_short Investigating the Diversity of Tuberculosis Spoligotypes with Dimensionality Reduction and Graph Theory
title_sort investigating the diversity of tuberculosis spoligotypes with dimensionality reduction and graph theory
topic <i>Mycobacterium tuberculosis</i>
CRISPR
dimensionality reduction
graph theory
url https://www.mdpi.com/2073-4425/13/12/2328
work_keys_str_mv AT gaetansenelle investigatingthediversityoftuberculosisspoligotypeswithdimensionalityreductionandgraphtheory
AT christopheguyeux investigatingthediversityoftuberculosisspoligotypeswithdimensionalityreductionandgraphtheory
AT guislainerefregier investigatingthediversityoftuberculosisspoligotypeswithdimensionalityreductionandgraphtheory
AT christophesola investigatingthediversityoftuberculosisspoligotypeswithdimensionalityreductionandgraphtheory