Mapping gene-by-gene single-nucleotide variation in 8,535 Mycobacterium tuberculosis genomes: a resource to support potential vaccine and drug development

<p>Tuberculosis (TB) is responsible for millions of deaths annually. More effective vaccines and new antituberculous drugs are essential to control the disease. Numerous genomic studies have advanced our knowledge about <i>M. tuberculosis</i> drug resistance, population structure,...

Full description

Bibliographic Details
Main Authors: Papakonstantinou, D, Dunn, SJ, Draper, SJ, Cunningham, AF, O'Shea, MK, McNally, A
Format: Journal article
Language:English
Published: American Society for Microbiology 2021
Description
Summary:<p>Tuberculosis (TB) is responsible for millions of deaths annually. More effective vaccines and new antituberculous drugs are essential to control the disease. Numerous genomic studies have advanced our knowledge about <i>M. tuberculosis</i> drug resistance, population structure, and transmission patterns. At the same time, reverse vaccinology and drug discovery pipelines have identified potential immunogenic vaccine candidates or drug targets. However, a better understanding of the sequence variation of all the <i>M. tuberculosis</i> genes on a large scale could aid in the identification of new vaccine and drug targets. Achieving this was the focus of the current study. Genome sequence data were obtained from online public sources covering seven <i>M. tuberculosis</i> lineages. A total of 8,535 genome sequences were mapped against <i>M. tuberculosis</i> H37Rv reference genome, in order to identify single nucleotide polymorphisms (SNPs). The results of the initial mapping were further processed, and a frequency distribution of nucleotide variants within genes was identified and further analyzed. The majority of genomic positions in the <i>M. tuberculosis</i> H37Rv genome were conserved. Genes with the highest level of conservation were often associated with stress responses and maintenance of redox balance. Conversely, genes with high levels of nucleotide variation were often associated with drug resistance. We have provided a high-resolution analysis of the single-nucleotide variation of all <i>M. tuberculosis</i> genes across seven lineages as a resource to support future drug and vaccine development. We have identified a number of highly conserved genes, important in <i>M. tuberculosis</i> biology, that could potentially be used as targets for novel vaccine candidates and antituberculous medications.</p> <p><strong>IMPORTANCE</strong> Tuberculosis is an infectious disease caused by the bacterium <i>Mycobacterium tuberculosis</i>. In the first half of the 20th century, the discovery of the <i>Mycobacterium bovis</i> BCG vaccine and antituberculous drugs heralded a new era in the control of TB. However, combating TB has proven challenging, especially with the emergence of HIV and drug resistance. A major hindrance in TB control is the lack of an effective vaccine, as the efficacy of BCG is geographically variable and provides little protection against pulmonary disease in high-risk groups. Our research is significant because it provides a resource to support future drug and vaccine development. We have achieved this by developing a better understanding of the nucleotide variation of all of the <i>M. tuberculosis</i> genes on a large scale and by identifying highly conserved genes that could potentially be used as targets for novel vaccine candidates and antituberculous medications.</p>