SRAdb: query and use public next-generation sequencing data from within R

<p>Abstract</p> <p>Background</p> <p>The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SO...

Full description

Bibliographic Details
Main Authors: Zhu Yuelin, Stephens Robert M, Meltzer Paul S, Davis Sean R
Format: Article
Language:English
Published: BMC 2013-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/14/19
_version_ 1811294553151373312
author Zhu Yuelin
Stephens Robert M
Meltzer Paul S
Davis Sean R
author_facet Zhu Yuelin
Stephens Robert M
Meltzer Paul S
Davis Sean R
author_sort Zhu Yuelin
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SOLiD System, Helicos Heliscope, PacBio RS, and others.</p> <p>Results</p> <p>SRAdb is an attempt to make queries of the metadata associated with SRA submission, study, sample, experiment and run more robust and precise, and make access to sequencing data in the SRA easier. We have parsed all the SRA metadata into a SQLite database that is routinely updated and can be easily distributed. The SRAdb R/Bioconductor package then utilizes this SQLite database for querying and accessing metadata. Full text search functionality makes querying metadata very flexible and powerful. Fastq files associated with query results can be downloaded easily for local analysis. The package also includes an interface from R to a popular genome browser, the Integrated Genomics Viewer.</p> <p>Conclusions</p> <p>SRAdb Bioconductor package provides a convenient and integrated framework to query and access SRA metadata quickly and powerfully from within R.</p>
first_indexed 2024-04-13T05:19:13Z
format Article
id doaj.art-66e34da473c84973ac0e320ba017882f
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-13T05:19:13Z
publishDate 2013-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-66e34da473c84973ac0e320ba017882f2022-12-22T03:00:48ZengBMCBMC Bioinformatics1471-21052013-01-011411910.1186/1471-2105-14-19SRAdb: query and use public next-generation sequencing data from within RZhu YuelinStephens Robert MMeltzer Paul SDavis Sean R<p>Abstract</p> <p>Background</p> <p>The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SOLiD System, Helicos Heliscope, PacBio RS, and others.</p> <p>Results</p> <p>SRAdb is an attempt to make queries of the metadata associated with SRA submission, study, sample, experiment and run more robust and precise, and make access to sequencing data in the SRA easier. We have parsed all the SRA metadata into a SQLite database that is routinely updated and can be easily distributed. The SRAdb R/Bioconductor package then utilizes this SQLite database for querying and accessing metadata. Full text search functionality makes querying metadata very flexible and powerful. Fastq files associated with query results can be downloaded easily for local analysis. The package also includes an interface from R to a popular genome browser, the Integrated Genomics Viewer.</p> <p>Conclusions</p> <p>SRAdb Bioconductor package provides a convenient and integrated framework to query and access SRA metadata quickly and powerfully from within R.</p>http://www.biomedcentral.com/1471-2105/14/19
spellingShingle Zhu Yuelin
Stephens Robert M
Meltzer Paul S
Davis Sean R
SRAdb: query and use public next-generation sequencing data from within R
BMC Bioinformatics
title SRAdb: query and use public next-generation sequencing data from within R
title_full SRAdb: query and use public next-generation sequencing data from within R
title_fullStr SRAdb: query and use public next-generation sequencing data from within R
title_full_unstemmed SRAdb: query and use public next-generation sequencing data from within R
title_short SRAdb: query and use public next-generation sequencing data from within R
title_sort sradb query and use public next generation sequencing data from within r
url http://www.biomedcentral.com/1471-2105/14/19
work_keys_str_mv AT zhuyuelin sradbqueryandusepublicnextgenerationsequencingdatafromwithinr
AT stephensrobertm sradbqueryandusepublicnextgenerationsequencingdatafromwithinr
AT meltzerpauls sradbqueryandusepublicnextgenerationsequencingdatafromwithinr
AT davisseanr sradbqueryandusepublicnextgenerationsequencingdatafromwithinr