ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.

BACKGROUND: Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification o...

Full description

Bibliographic Details
Main Authors: Ramón Díaz-Uriarte, Oscar M Rueda
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2007-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC1940324?pdf=render
_version_ 1819262588106571776
author Ramón Díaz-Uriarte
Oscar M Rueda
author_facet Ramón Díaz-Uriarte
Oscar M Rueda
author_sort Ramón Díaz-Uriarte
collection DOAJ
description BACKGROUND: Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification of CNAs has resulted in a wealth of methodological studies. METHODOLOGY/PRINCIPAL FINDINGS: ADaCGH is an R package and a web-based application for the analysis of aCGH data. It implements eight methods for detection of CNAs, gains and losses of genomic DNA, including all of the best performing ones from two recent reviews (CBS, GLAD, CGHseg, HMM). For improved speed, we use parallel computing (via MPI). Additional information (GO terms, PubMed citations, KEGG and Reactome pathways) is available for individual genes, and for sets of genes with altered copy numbers. CONCLUSIONS/SIGNIFICANCE: ADACGH represents a qualitative increase in the standards of these types of applications: a) all of the best performing algorithms are included, not just one or two; b) we do not limit ourselves to providing a thin layer of CGI on top of existing BioConductor packages, but instead carefully use parallelization, examining different schemes, and are able to achieve significant decreases in user waiting time (factors up to 45x); c) we have added functionality not currently available in some methods, to adapt to recent recommendations (e.g., merging of segmentation results in wavelet-based and CGHseg algorithms); d) we incorporate redundancy, fault-tolerance and checkpointing, which are unique among web-based, parallelized applications; e) all of the code is available under open source licenses, allowing to build upon, copy, and adapt our code for other software projects.
first_indexed 2024-12-23T20:00:05Z
format Article
id doaj.art-33806b24c08041b6b834831722080e40
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-23T20:00:05Z
publishDate 2007-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-33806b24c08041b6b834831722080e402022-12-21T17:33:06ZengPublic Library of Science (PLoS)PLoS ONE1932-62032007-01-0128e73710.1371/journal.pone.0000737ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.Ramón Díaz-UriarteOscar M RuedaBACKGROUND: Copy number alterations (CNAs) in genomic DNA have been associated with complex human diseases, including cancer. One of the most common techniques to detect CNAs is array-based comparative genomic hybridization (aCGH). The availability of aCGH platforms and the need for identification of CNAs has resulted in a wealth of methodological studies. METHODOLOGY/PRINCIPAL FINDINGS: ADaCGH is an R package and a web-based application for the analysis of aCGH data. It implements eight methods for detection of CNAs, gains and losses of genomic DNA, including all of the best performing ones from two recent reviews (CBS, GLAD, CGHseg, HMM). For improved speed, we use parallel computing (via MPI). Additional information (GO terms, PubMed citations, KEGG and Reactome pathways) is available for individual genes, and for sets of genes with altered copy numbers. CONCLUSIONS/SIGNIFICANCE: ADACGH represents a qualitative increase in the standards of these types of applications: a) all of the best performing algorithms are included, not just one or two; b) we do not limit ourselves to providing a thin layer of CGI on top of existing BioConductor packages, but instead carefully use parallelization, examining different schemes, and are able to achieve significant decreases in user waiting time (factors up to 45x); c) we have added functionality not currently available in some methods, to adapt to recent recommendations (e.g., merging of segmentation results in wavelet-based and CGHseg algorithms); d) we incorporate redundancy, fault-tolerance and checkpointing, which are unique among web-based, parallelized applications; e) all of the code is available under open source licenses, allowing to build upon, copy, and adapt our code for other software projects.http://europepmc.org/articles/PMC1940324?pdf=render
spellingShingle Ramón Díaz-Uriarte
Oscar M Rueda
ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
PLoS ONE
title ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
title_full ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
title_fullStr ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
title_full_unstemmed ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
title_short ADaCGH: A parallelized web-based application and R package for the analysis of aCGH data.
title_sort adacgh a parallelized web based application and r package for the analysis of acgh data
url http://europepmc.org/articles/PMC1940324?pdf=render
work_keys_str_mv AT ramondiazuriarte adacghaparallelizedwebbasedapplicationandrpackagefortheanalysisofacghdata
AT oscarmrueda adacghaparallelizedwebbasedapplicationandrpackagefortheanalysisofacghdata