Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica
Abstract Background Bacteriophages are bacterial parasites and are considered the most abundant and diverse biological entities on the planet. Previously we identified 154 prophages from 151 serovars of Salmonella enterica subsp. enterica. A detailed analysis of Salmonella prophage genomics is requi...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2020-05-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12864-020-6765-z |
_version_ | 1818304685398818816 |
---|---|
author | Ruimin Gao Sohail Naushad Sylvain Moineau Roger Levesque Lawrence Goodridge Dele Ogunremi |
author_facet | Ruimin Gao Sohail Naushad Sylvain Moineau Roger Levesque Lawrence Goodridge Dele Ogunremi |
author_sort | Ruimin Gao |
collection | DOAJ |
description | Abstract Background Bacteriophages are bacterial parasites and are considered the most abundant and diverse biological entities on the planet. Previously we identified 154 prophages from 151 serovars of Salmonella enterica subsp. enterica. A detailed analysis of Salmonella prophage genomics is required given the influence of phages on their bacterial hosts and should provide a broader understanding of Salmonella biology and virulence and contribute to the practical applications of phages as vectors and antibacterial agents. Results Here we provide a comparative analysis of the full genome sequences of 142 prophages of Salmonella enterica subsp. enterica which is the full complement of the prophages that could be retrieved from public databases. We discovered extensive variation in genome sizes (ranging from 6.4 to 358.7 kb) and guanine plus cytosine (GC) content (ranging from 35.5 to 65.4%) and observed a linear correlation between the genome size and the number of open reading frames (ORFs). We used three approaches to compare the phage genomes. The NUCmer/MUMmer genome alignment tool was used to evaluate linkages and correlations based on nucleotide identity between genomes. Multiple sequence alignment was performed to calculate genome average nucleotide identity using the Kalgin program. Finally, genome synteny was explored using dot plot analysis. We found that 90 phage genome sequences grouped into 17 distinct clusters while the remaining 52 genomes showed no close relationships with the other phage genomes and are identified as singletons. We generated genome maps using nucleotide and amino acid sequences which allowed protein-coding genes to be sorted into phamilies (phams) using the Phamerator software. Out of 5796 total assigned phamilies, one phamily was observed to be dominant and was found in 49 prophages, or 34.5% of the 142 phages in our collection. A majority of the phamilies, 4330 out of 5796 (74.7%), occurred in just one prophage underscoring the high degree of diversity among Salmonella bacteriophages. Conclusions Based on nucleotide and amino acid sequences, a high diversity was found among Salmonella bacteriophages which validate the use of prophage sequence analysis as a highly discriminatory subtyping tool for Salmonella. Thorough understanding of the conservation and variation of prophage genomic characteristics will facilitate their rational design and use as tools for bacterial strain construction, vector development and as anti-bacterial agents. |
first_indexed | 2024-12-13T06:14:37Z |
format | Article |
id | doaj.art-8e304237af2a4aedb67d0464bb659616 |
institution | Directory Open Access Journal |
issn | 1471-2164 |
language | English |
last_indexed | 2024-12-13T06:14:37Z |
publishDate | 2020-05-01 |
publisher | BMC |
record_format | Article |
series | BMC Genomics |
spelling | doaj.art-8e304237af2a4aedb67d0464bb6596162022-12-21T23:57:00ZengBMCBMC Genomics1471-21642020-05-0121111310.1186/s12864-020-6765-zComparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. entericaRuimin Gao0Sohail Naushad1Sylvain Moineau2Roger Levesque3Lawrence Goodridge4Dele Ogunremi5Ottawa Laboratory Fallowfield, Canadian Food Inspection AgencyOttawa Laboratory Fallowfield, Canadian Food Inspection AgencyFélix d’Hérelle Reference Center for Bacterial Viruses, Faculté de médecine dentaire, Université LavalInstitut de Biologie Intégrative et des Systèmes, Université LavalPresent Address:Department of Food Science, University of GuelphOttawa Laboratory Fallowfield, Canadian Food Inspection AgencyAbstract Background Bacteriophages are bacterial parasites and are considered the most abundant and diverse biological entities on the planet. Previously we identified 154 prophages from 151 serovars of Salmonella enterica subsp. enterica. A detailed analysis of Salmonella prophage genomics is required given the influence of phages on their bacterial hosts and should provide a broader understanding of Salmonella biology and virulence and contribute to the practical applications of phages as vectors and antibacterial agents. Results Here we provide a comparative analysis of the full genome sequences of 142 prophages of Salmonella enterica subsp. enterica which is the full complement of the prophages that could be retrieved from public databases. We discovered extensive variation in genome sizes (ranging from 6.4 to 358.7 kb) and guanine plus cytosine (GC) content (ranging from 35.5 to 65.4%) and observed a linear correlation between the genome size and the number of open reading frames (ORFs). We used three approaches to compare the phage genomes. The NUCmer/MUMmer genome alignment tool was used to evaluate linkages and correlations based on nucleotide identity between genomes. Multiple sequence alignment was performed to calculate genome average nucleotide identity using the Kalgin program. Finally, genome synteny was explored using dot plot analysis. We found that 90 phage genome sequences grouped into 17 distinct clusters while the remaining 52 genomes showed no close relationships with the other phage genomes and are identified as singletons. We generated genome maps using nucleotide and amino acid sequences which allowed protein-coding genes to be sorted into phamilies (phams) using the Phamerator software. Out of 5796 total assigned phamilies, one phamily was observed to be dominant and was found in 49 prophages, or 34.5% of the 142 phages in our collection. A majority of the phamilies, 4330 out of 5796 (74.7%), occurred in just one prophage underscoring the high degree of diversity among Salmonella bacteriophages. Conclusions Based on nucleotide and amino acid sequences, a high diversity was found among Salmonella bacteriophages which validate the use of prophage sequence analysis as a highly discriminatory subtyping tool for Salmonella. Thorough understanding of the conservation and variation of prophage genomic characteristics will facilitate their rational design and use as tools for bacterial strain construction, vector development and as anti-bacterial agents.http://link.springer.com/article/10.1186/s12864-020-6765-zComparative genomicsBacteriophageNucleotide identitySalmonella entericaPhameratorProphage sequence typing |
spellingShingle | Ruimin Gao Sohail Naushad Sylvain Moineau Roger Levesque Lawrence Goodridge Dele Ogunremi Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica BMC Genomics Comparative genomics Bacteriophage Nucleotide identity Salmonella enterica Phamerator Prophage sequence typing |
title | Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica |
title_full | Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica |
title_fullStr | Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica |
title_full_unstemmed | Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica |
title_short | Comparative genomic analysis of 142 bacteriophages infecting Salmonella enterica subsp. enterica |
title_sort | comparative genomic analysis of 142 bacteriophages infecting salmonella enterica subsp enterica |
topic | Comparative genomics Bacteriophage Nucleotide identity Salmonella enterica Phamerator Prophage sequence typing |
url | http://link.springer.com/article/10.1186/s12864-020-6765-z |
work_keys_str_mv | AT ruimingao comparativegenomicanalysisof142bacteriophagesinfectingsalmonellaentericasubspenterica AT sohailnaushad comparativegenomicanalysisof142bacteriophagesinfectingsalmonellaentericasubspenterica AT sylvainmoineau comparativegenomicanalysisof142bacteriophagesinfectingsalmonellaentericasubspenterica AT rogerlevesque comparativegenomicanalysisof142bacteriophagesinfectingsalmonellaentericasubspenterica AT lawrencegoodridge comparativegenomicanalysisof142bacteriophagesinfectingsalmonellaentericasubspenterica AT deleogunremi comparativegenomicanalysisof142bacteriophagesinfectingsalmonellaentericasubspenterica |