proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis

<p>Abstract</p> <p>Background</p> <p>Throughout the metazoan lineage, typically gonadal expressed Piwi proteins and their guiding piRNAs (~26-32nt in length) form a protective mechanism of RNA interference directed against the propagation of transposable elements (TEs)....

Full description

Bibliographic Details
Main Authors: Rosenkranz David, Zischler Hans
Format: Article
Language:English
Published: BMC 2012-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/13/5
_version_ 1811281869766918144
author Rosenkranz David
Zischler Hans
author_facet Rosenkranz David
Zischler Hans
author_sort Rosenkranz David
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Throughout the metazoan lineage, typically gonadal expressed Piwi proteins and their guiding piRNAs (~26-32nt in length) form a protective mechanism of RNA interference directed against the propagation of transposable elements (TEs). Most piRNAs are generated from genomic piRNA clusters. Annotation of experimentally obtained piRNAs from small RNA/cDNA-libraries and detection of genomic piRNA clusters are crucial for a thorough understanding of the still enigmatic piRNA pathway, especially in an evolutionary context. Currently, detection of piRNA clusters relies on bioinformatics rather than detection and sequencing of primary piRNA cluster transcripts and the stringency of the methods applied in different studies differs considerably. Additionally, not all important piRNA cluster characteristics were taken into account during bioinformatic processing. Depending on the applied method this can lead to: i) an accidentally underrepresentation of TE related piRNAs, ii) overlook duplicated clusters harboring few or no single-copy loci and iii) false positive annotation of clusters that are in fact just accumulations of multi-copy loci corresponding to frequently mapped reads, but are not transcribed to piRNA precursors.</p> <p>Results</p> <p>We developed a software which detects and analyses piRNA clusters (proTRAC, probabilistic TRacking and Analysis of Clusters) based on quantifiable deviations from a hypothetical uniform distribution regarding the decisive piRNA cluster characteristics. We used piRNA sequences from human, macaque, mouse and rat to identify piRNA clusters in the respective species with proTRAC and compared the obtained results with piRNA cluster annotation from piRNABank and the results generated by different hitherto applied methods.</p> <p>proTRAC identified clusters not annotated at piRNABank and rejected annotated clusters based on the absence of important features like strand asymmetry. We further show, that proTRAC detects clusters that are passed over if a minimum number of single-copy piRNA loci are required and that proTRAC assigns more sequence reads per cluster since it does not preclude frequently mapped reads from the analysis.</p> <p>Conclusions</p> <p>With proTRAC we provide a reliable tool for detection, visualization and analysis of piRNA clusters. Detected clusters are well supported by comprehensible probabilistic parameters and retain a maximum amount of information, thus overcoming the present conflict of sensitivity and specificity in piRNA cluster detection.</p>
first_indexed 2024-04-13T01:41:23Z
format Article
id doaj.art-e072ad1f7bfb4857b9a08f018ecbcc13
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-13T01:41:23Z
publishDate 2012-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-e072ad1f7bfb4857b9a08f018ecbcc132022-12-22T03:08:11ZengBMCBMC Bioinformatics1471-21052012-01-01131510.1186/1471-2105-13-5proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysisRosenkranz DavidZischler Hans<p>Abstract</p> <p>Background</p> <p>Throughout the metazoan lineage, typically gonadal expressed Piwi proteins and their guiding piRNAs (~26-32nt in length) form a protective mechanism of RNA interference directed against the propagation of transposable elements (TEs). Most piRNAs are generated from genomic piRNA clusters. Annotation of experimentally obtained piRNAs from small RNA/cDNA-libraries and detection of genomic piRNA clusters are crucial for a thorough understanding of the still enigmatic piRNA pathway, especially in an evolutionary context. Currently, detection of piRNA clusters relies on bioinformatics rather than detection and sequencing of primary piRNA cluster transcripts and the stringency of the methods applied in different studies differs considerably. Additionally, not all important piRNA cluster characteristics were taken into account during bioinformatic processing. Depending on the applied method this can lead to: i) an accidentally underrepresentation of TE related piRNAs, ii) overlook duplicated clusters harboring few or no single-copy loci and iii) false positive annotation of clusters that are in fact just accumulations of multi-copy loci corresponding to frequently mapped reads, but are not transcribed to piRNA precursors.</p> <p>Results</p> <p>We developed a software which detects and analyses piRNA clusters (proTRAC, probabilistic TRacking and Analysis of Clusters) based on quantifiable deviations from a hypothetical uniform distribution regarding the decisive piRNA cluster characteristics. We used piRNA sequences from human, macaque, mouse and rat to identify piRNA clusters in the respective species with proTRAC and compared the obtained results with piRNA cluster annotation from piRNABank and the results generated by different hitherto applied methods.</p> <p>proTRAC identified clusters not annotated at piRNABank and rejected annotated clusters based on the absence of important features like strand asymmetry. We further show, that proTRAC detects clusters that are passed over if a minimum number of single-copy piRNA loci are required and that proTRAC assigns more sequence reads per cluster since it does not preclude frequently mapped reads from the analysis.</p> <p>Conclusions</p> <p>With proTRAC we provide a reliable tool for detection, visualization and analysis of piRNA clusters. Detected clusters are well supported by comprehensible probabilistic parameters and retain a maximum amount of information, thus overcoming the present conflict of sensitivity and specificity in piRNA cluster detection.</p>http://www.biomedcentral.com/1471-2105/13/5
spellingShingle Rosenkranz David
Zischler Hans
proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis
BMC Bioinformatics
title proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis
title_full proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis
title_fullStr proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis
title_full_unstemmed proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis
title_short proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis
title_sort protrac a software for probabilistic pirna cluster detection visualization and analysis
url http://www.biomedcentral.com/1471-2105/13/5
work_keys_str_mv AT rosenkranzdavid protracasoftwareforprobabilisticpirnaclusterdetectionvisualizationandanalysis
AT zischlerhans protracasoftwareforprobabilisticpirnaclusterdetectionvisualizationandanalysis