De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds

The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a...

Full description

Bibliographic Details
Main Authors: Dudchenko, Olga, Batra, Sanjit S., Omer, Arina D., Hoeger, Marie, Durand, Neva C., Shamim, Muhammad S., Machol, Ido, Aiden, Aviva Presser, Aiden, Erez Lieberman, Nyquist, Sarah Kate, Lander, Eric Steven
Other Authors: Massachusetts Institute of Technology. Computational and Systems Biology Program
Format: Article
Published: American Association for the Advancement of Science (AAAS) 2018
Online Access:http://hdl.handle.net/1721.1/116720
Description
Summary:The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Ae. aegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species.