iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences

<p>Abstract</p> <p>Background</p> <p>Expressed Sequence Tags (ESTs) have played significant roles in gene discovery and gene functional analysis, especially for non-model organisms. For organisms with no full genome sequences available, ESTs are normally assembled into...

Full description

Bibliographic Details
Main Authors: Zheng Yi, Zhao Liangjun, Gao Junping, Fei Zhangjun
Format: Article
Language:English
Published: BMC 2011-11-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/453
_version_ 1818208612378476544
author Zheng Yi
Zhao Liangjun
Gao Junping
Fei Zhangjun
author_facet Zheng Yi
Zhao Liangjun
Gao Junping
Fei Zhangjun
author_sort Zheng Yi
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Expressed Sequence Tags (ESTs) have played significant roles in gene discovery and gene functional analysis, especially for non-model organisms. For organisms with no full genome sequences available, ESTs are normally assembled into longer consensus sequences for further downstream analysis. However current <it>de novo </it>EST assembly programs often generate large number of assembly errors that will negatively affect the downstream analysis. In order to generate more accurate consensus sequences from ESTs, tools are needed to reduce or eliminate errors from <it>de novo </it>assemblies.</p> <p>Results</p> <p>We present iAssembler, a pipeline that can assemble large-scale ESTs into consensus sequences with significantly higher accuracy than current existing assemblers. iAssembler employs MIRA and CAP3 assemblers to generate initial assemblies, followed by identifying and correcting two common types of transcriptome assembly errors: 1) ESTs from different transcripts (mainly alternatively spliced transcripts or paralogs) are incorrectly assembled into same contigs; and 2) ESTs from same transcripts fail to be assembled together. iAssembler can be used to assemble ESTs generated using the traditional Sanger method and/or the Roche-454 massive parallel pyrosequencing technology.</p> <p>Conclusion</p> <p>We compared performances of iAssembler and several other <it>de novo </it>EST assembly programs using both Roche-454 and Sanger EST datasets. It demonstrated that iAssembler generated significantly more accurate consensus sequences than other assembly programs.</p>
first_indexed 2024-12-12T04:47:35Z
format Article
id doaj.art-459ada77a9924d2eb5b44745cdb9535e
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-12T04:47:35Z
publishDate 2011-11-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-459ada77a9924d2eb5b44745cdb9535e2022-12-22T00:37:34ZengBMCBMC Bioinformatics1471-21052011-11-0112145310.1186/1471-2105-12-453iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequencesZheng YiZhao LiangjunGao JunpingFei Zhangjun<p>Abstract</p> <p>Background</p> <p>Expressed Sequence Tags (ESTs) have played significant roles in gene discovery and gene functional analysis, especially for non-model organisms. For organisms with no full genome sequences available, ESTs are normally assembled into longer consensus sequences for further downstream analysis. However current <it>de novo </it>EST assembly programs often generate large number of assembly errors that will negatively affect the downstream analysis. In order to generate more accurate consensus sequences from ESTs, tools are needed to reduce or eliminate errors from <it>de novo </it>assemblies.</p> <p>Results</p> <p>We present iAssembler, a pipeline that can assemble large-scale ESTs into consensus sequences with significantly higher accuracy than current existing assemblers. iAssembler employs MIRA and CAP3 assemblers to generate initial assemblies, followed by identifying and correcting two common types of transcriptome assembly errors: 1) ESTs from different transcripts (mainly alternatively spliced transcripts or paralogs) are incorrectly assembled into same contigs; and 2) ESTs from same transcripts fail to be assembled together. iAssembler can be used to assemble ESTs generated using the traditional Sanger method and/or the Roche-454 massive parallel pyrosequencing technology.</p> <p>Conclusion</p> <p>We compared performances of iAssembler and several other <it>de novo </it>EST assembly programs using both Roche-454 and Sanger EST datasets. It demonstrated that iAssembler generated significantly more accurate consensus sequences than other assembly programs.</p>http://www.biomedcentral.com/1471-2105/12/453
spellingShingle Zheng Yi
Zhao Liangjun
Gao Junping
Fei Zhangjun
iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences
BMC Bioinformatics
title iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences
title_full iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences
title_fullStr iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences
title_full_unstemmed iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences
title_short iAssembler: a package for <it>de novo </it>assembly of Roche-454/Sanger transcriptome sequences
title_sort iassembler a package for it de novo it assembly of roche 454 sanger transcriptome sequences
url http://www.biomedcentral.com/1471-2105/12/453
work_keys_str_mv AT zhengyi iassemblerapackageforitdenovoitassemblyofroche454sangertranscriptomesequences
AT zhaoliangjun iassemblerapackageforitdenovoitassemblyofroche454sangertranscriptomesequences
AT gaojunping iassemblerapackageforitdenovoitassemblyofroche454sangertranscriptomesequences
AT feizhangjun iassemblerapackageforitdenovoitassemblyofroche454sangertranscriptomesequences