The Ruby UCSC API: accessing the UCSC genome database using Ruby

<p>Abstract</p> <p>Background</p> <p>The University of California, Santa Cruz (UCSC) genome database is among the most used sources of genomic annotation in human and other organisms. The database offers an excellent web-based graphical user interface (the UCSC genome b...

Full description

Bibliographic Details
Main Authors: Mishima Hiroyuki, Aerts Jan, Katayama Toshiaki, Bonnal Raoul J P, Yoshiura Koh-ichiro
Format: Article
Language:English
Published: BMC 2012-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/13/240
Description
Summary:<p>Abstract</p> <p>Background</p> <p>The University of California, Santa Cruz (UCSC) genome database is among the most used sources of genomic annotation in human and other organisms. The database offers an excellent web-based graphical user interface (the UCSC genome browser) and several means for programmatic queries. A simple application programming interface (API) in a scripting language aimed at the biologist was however not yet available. Here, we present the Ruby UCSC API, a library to access the UCSC genome database using Ruby.</p> <p>Results</p> <p>The API is designed as a BioRuby plug-in and built on the ActiveRecord 3 framework for the object-relational mapping, making writing SQL statements unnecessary. The current version of the API supports databases of all organisms in the UCSC genome database including human, mammals, vertebrates, deuterostomes, insects, nematodes, and yeast.</p> <p>The API uses the bin index—if available—when querying for genomic intervals. The API also supports genomic sequence queries using locally downloaded *.2bit files that are not stored in the official MySQL database. The API is implemented in pure Ruby and is therefore available in different environments and with different Ruby interpreters (including JRuby).</p> <p>Conclusions</p> <p>Assisted by the straightforward object-oriented design of Ruby and ActiveRecord, the Ruby UCSC API will facilitate biologists to query the UCSC genome database programmatically. The API is available through the RubyGem system. Source code and documentation are available at <url>https://github.com/misshie/bioruby-ucsc-api/</url> under the Ruby license. Feedback and help is provided via the website at <url>http://rubyucscapi.userecho.com/</url>.</p>
ISSN:1471-2105