Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments

Recombination plays an important role in shaping the genetic diversity of a number of DNA and RNA viruses. Although some recent studies have reported bioinformatic evidence of mosaic sequences in a variety of influenza A viruses, it remains controversial as to whether these represent bona fide natur...

Full description

Bibliographic Details
Main Authors: Lam, T, Chong, Y, Shi, M, Hon, C, Li, J, Martin, D, Tang, J, Mok, C, Shih, S, Yip, C, Jiang, J, Hui, R, Pybus, O, Holmes, E, Leung, F
Format: Journal article
Language:English
Published: 2013
_version_ 1826260725343977472
author Lam, T
Chong, Y
Shi, M
Hon, C
Li, J
Martin, D
Tang, J
Mok, C
Shih, S
Yip, C
Jiang, J
Hui, R
Pybus, O
Holmes, E
Leung, F
author_facet Lam, T
Chong, Y
Shi, M
Hon, C
Li, J
Martin, D
Tang, J
Mok, C
Shih, S
Yip, C
Jiang, J
Hui, R
Pybus, O
Holmes, E
Leung, F
author_sort Lam, T
collection OXFORD
description Recombination plays an important role in shaping the genetic diversity of a number of DNA and RNA viruses. Although some recent studies have reported bioinformatic evidence of mosaic sequences in a variety of influenza A viruses, it remains controversial as to whether these represent bona fide natural recombination events or laboratory artifacts. Importantly, mosaic genome structures can create significant topological incongruence during phylogenetic analyses, which can mislead additional phylogeny-based molecular evolutionary analyses such as molecular clock dating, the detection of selection pressures and phylogeographic inference. As a result, there is a strong need for systematic screenings for mosaic structures within the influenza virus genome database. We used a combination of sequence-based and phylogeny-based methods to identify 388 mosaic influenza genomic segments, of which 332 are previously unreported and are significantly supported by phylogenetic methods. It is impossible, however, to ascertain whether these represent natural recombinants. To facilitate the future identification of recombinants, reference sets of non-recombinant sequences were selected for use in an automatic screening protocol for detecting mosaic sequences. Tests using real and simulated mosaic sequences indicate that our screening protocol is both sensitive (average >90%) and accurate (average >77%) enough to identify a range of different mosaic patterns. The relatively high prevalence of mosaic influenza virus sequences implies that efficient systematic screens, such as that proposed here, should be performed routinely to detect natural recombinant strains, potential laboratory artifacts, and sequencing contaminants either prior to sequences being deposited in GenBank or before they are used for phylogenetic analyses. © 2013 Elsevier B.V.
first_indexed 2024-03-06T19:10:15Z
format Journal article
id oxford-uuid:1688e870-789a-4d77-8cb5-d8a2c66fd83c
institution University of Oxford
language English
last_indexed 2024-03-06T19:10:15Z
publishDate 2013
record_format dspace
spelling oxford-uuid:1688e870-789a-4d77-8cb5-d8a2c66fd83c2022-03-26T10:31:51ZSystematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segmentsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:1688e870-789a-4d77-8cb5-d8a2c66fd83cEnglishSymplectic Elements at Oxford2013Lam, TChong, YShi, MHon, CLi, JMartin, DTang, JMok, CShih, SYip, CJiang, JHui, RPybus, OHolmes, ELeung, FRecombination plays an important role in shaping the genetic diversity of a number of DNA and RNA viruses. Although some recent studies have reported bioinformatic evidence of mosaic sequences in a variety of influenza A viruses, it remains controversial as to whether these represent bona fide natural recombination events or laboratory artifacts. Importantly, mosaic genome structures can create significant topological incongruence during phylogenetic analyses, which can mislead additional phylogeny-based molecular evolutionary analyses such as molecular clock dating, the detection of selection pressures and phylogeographic inference. As a result, there is a strong need for systematic screenings for mosaic structures within the influenza virus genome database. We used a combination of sequence-based and phylogeny-based methods to identify 388 mosaic influenza genomic segments, of which 332 are previously unreported and are significantly supported by phylogenetic methods. It is impossible, however, to ascertain whether these represent natural recombinants. To facilitate the future identification of recombinants, reference sets of non-recombinant sequences were selected for use in an automatic screening protocol for detecting mosaic sequences. Tests using real and simulated mosaic sequences indicate that our screening protocol is both sensitive (average >90%) and accurate (average >77%) enough to identify a range of different mosaic patterns. The relatively high prevalence of mosaic influenza virus sequences implies that efficient systematic screens, such as that proposed here, should be performed routinely to detect natural recombinant strains, potential laboratory artifacts, and sequencing contaminants either prior to sequences being deposited in GenBank or before they are used for phylogenetic analyses. © 2013 Elsevier B.V.
spellingShingle Lam, T
Chong, Y
Shi, M
Hon, C
Li, J
Martin, D
Tang, J
Mok, C
Shih, S
Yip, C
Jiang, J
Hui, R
Pybus, O
Holmes, E
Leung, F
Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
title Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
title_full Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
title_fullStr Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
title_full_unstemmed Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
title_short Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segments
title_sort systematic phylogenetic analysis of influenza a virus reveals many novel mosaic genome segments
work_keys_str_mv AT lamt systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT chongy systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT shim systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT honc systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT lij systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT martind systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT tangj systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT mokc systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT shihs systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT yipc systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT jiangj systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT huir systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT pybuso systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT holmese systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments
AT leungf systematicphylogeneticanalysisofinfluenzaavirusrevealsmanynovelmosaicgenomesegments