Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes

Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (...

Descripció completa

Dades bibliogràfiques
Autors principals: De Maio, N, Shaw, L, Hubbard, A, George, S, Sanderson, N, Swann, J, Wick, R, Abuoun, M, Stubberfield, E, Hoosdally, S, Crook, D, Peto, T, Sheppard, A, Bailey, M, Read, D, Anjum, M, Walker, A, Stoesser, N
Format: Journal article
Idioma:English
Publicat: Microbiology Society 2019
_version_ 1826258185447538688
author De Maio, N
Shaw, L
Hubbard, A
George, S
Sanderson, N
Swann, J
Wick, R
Abuoun, M
Stubberfield, E
Hoosdally, S
Crook, D
Peto, T
Sheppard, A
Bailey, M
Read, D
Anjum, M
Walker, A
Stoesser, N
author_facet De Maio, N
Shaw, L
Hubbard, A
George, S
Sanderson, N
Swann, J
Wick, R
Abuoun, M
Stubberfield, E
Hoosdally, S
Crook, D
Peto, T
Sheppard, A
Bailey, M
Read, D
Anjum, M
Walker, A
Stoesser, N
author_sort De Maio, N
collection OXFORD
description Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly.
first_indexed 2024-03-06T18:30:00Z
format Journal article
id oxford-uuid:094dd989-11e4-41e0-836f-8771ab908327
institution University of Oxford
language English
last_indexed 2024-03-06T18:30:00Z
publishDate 2019
publisher Microbiology Society
record_format dspace
spelling oxford-uuid:094dd989-11e4-41e0-836f-8771ab9083272022-03-26T09:17:39ZComparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:094dd989-11e4-41e0-836f-8771ab908327EnglishSymplectic Elements at OxfordMicrobiology Society2019De Maio, NShaw, LHubbard, AGeorge, SSanderson, NSwann, JWick, RAbuoun, MStubberfield, EHoosdally, SCrook, DPeto, TSheppard, ABailey, MRead, DAnjum, MWalker, AStoesser, NIllumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly.
spellingShingle De Maio, N
Shaw, L
Hubbard, A
George, S
Sanderson, N
Swann, J
Wick, R
Abuoun, M
Stubberfield, E
Hoosdally, S
Crook, D
Peto, T
Sheppard, A
Bailey, M
Read, D
Anjum, M
Walker, A
Stoesser, N
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_full Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_fullStr Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_full_unstemmed Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_short Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
title_sort comparison of long read sequencing technologies in the hybrid assembly of complex bacterial genomes
work_keys_str_mv AT demaion comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT shawl comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT hubbarda comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT georges comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT sandersonn comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT swannj comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT wickr comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT abuounm comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT stubberfielde comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT hoosdallys comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT crookd comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT petot comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT shepparda comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT baileym comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT readd comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT anjumm comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT walkera comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes
AT stoessern comparisonoflongreadsequencingtechnologiesinthehybridassemblyofcomplexbacterialgenomes