An EST resource for tilapia based on 17 normalized libraries and assembly of 116,899 sequence tags

<p>Abstract</p> <p>Background</p> <p>Large collections of expressed sequence tags (ESTs) are a fundamental resource for analysis of gene expression and annotation of genome sequences. We generated 116,899 ESTs from 17 normalized and two non-normalized cDNA libraries rep...

Full description

Bibliographic Details
Main Authors: Baroiller Jean-Francois, Pepey Elodie, D'Cotta Helena, Conte Matthew A, Howe Aimee E, Lee Bo-Young, di Palma Federica, Carleton Karen L, Kocher Thomas D
Format: Article
Language:English
Published: BMC 2010-04-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/11/278
Description
Summary:<p>Abstract</p> <p>Background</p> <p>Large collections of expressed sequence tags (ESTs) are a fundamental resource for analysis of gene expression and annotation of genome sequences. We generated 116,899 ESTs from 17 normalized and two non-normalized cDNA libraries representing 16 tissues from tilapia, a cichlid fish widely used in aquaculture and biological research.</p> <p>Results</p> <p>The ESTs were assembled into 20,190 contigs and 36,028 singletons for a total of 56,218 unique sequences and a total assembled length of 35,168,415 bp. Over the whole project, a unique sequence was discovered for every 2.079 sequence reads. 17,722 (31.5%) of these unique sequences had significant BLAST hits (e-value < 10<sup>-10</sup>) to the UniProt database.</p> <p>Conclusion</p> <p>Normalization of the cDNA pools with double-stranded nuclease allowed us to efficiently sequence a large collection of ESTs. These sequences are an important resource for studies of gene expression, comparative mapping and annotation of the forthcoming tilapia genome sequence.</p>
ISSN:1471-2164