ProTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis

David Rosenkranz,Hans Zischler

doi:10.1186/1471-2105-13-5

Abstract

BackgroundThroughout the metazoan lineage, typically gonadal expressed Piwi proteins and their guiding piRNAs (~26-32nt in length) form a protective mechanism of RNA interference directed against the propagation of transposable elements (TEs). Most piRNAs are generated from genomic piRNA clusters. Annotation of experimentally obtained piRNAs from small RNA/cDNA-libraries and detection of genomic piRNA clusters are crucial for a thorough understanding of the still enigmatic piRNA pathway, especially in an evolutionary context. Currently, detection of piRNA clusters relies on bioinformatics rather than detection and sequencing of primary piRNA cluster transcripts and the stringency of the methods applied in different studies differs considerably. Additionally, not all important piRNA cluster characteristics were taken into account during bioinformatic processing. Depending on the applied method this can lead to: i) an accidentally underrepresentation of TE related piRNAs, ii) overlook duplicated clusters harboring few or no single-copy loci and iii) false positive annotation of clusters that are in fact just accumulations of multi-copy loci corresponding to frequently mapped reads, but are not transcribed to piRNA precursors.ResultsWe developed a software which detects and analyses piRNA clusters (proTRAC, probabilistic TRacking and Analysis of Clusters) based on quantifiable deviations from a hypothetical uniform distribution regarding the decisive piRNA cluster characteristics. We used piRNA sequences from human, macaque, mouse and rat to identify piRNA clusters in the respective species with proTRAC and compared the obtained results with piRNA cluster annotation from piRNABank and the results generated by different hitherto applied methods.proTRAC identified clusters not annotated at piRNABank and rejected annotated clusters based on the absence of important features like strand asymmetry. We further show, that proTRAC detects clusters that are passed over if a minimum number of single-copy piRNA loci are required and that proTRAC assigns more sequence reads per cluster since it does not preclude frequently mapped reads from the analysis.ConclusionsWith proTRAC we provide a reliable tool for detection, visualization and analysis of piRNA clusters. Detected clusters are well supported by comprehensible probabilistic parameters and retain a maximum amount of information, thus overcoming the present conflict of sensitivity and specificity in piRNA cluster detection.

Highlights

Throughout the metazoan lineage, typically gonadal expressed Piwi proteins and their guiding PIWI-interacting RNAs (piRNAs) (~26-32nt in length) form a protective mechanism of RNA interference directed against the propagation of transposable elements (TEs)
That real piRNA clusters are concealed by the presence of loci that correspond to frequently mapped sequence reads that do not originate from the cluster in question but distort its strand asymmetry, proTRAC optionally reevaluates rejected cluster candidates in that case, considering only loci that correspond to sequence reads that mapped not more than a stated maximum times to the genome, similar to the method applied by Girard et al 2006 [4]
Nonredundant human (32046), mouse (34520) and rat (31357) piRNA sequences downloaded from National Center for Biotechnology Information (NCBI) nucleotide database [9] were mapped to the respective genomes via SeqMap and the SeqMap output files were used as input files for proTRAC

Summary

Results

We developed a software which detects and analyses piRNA clusters (proTRAC, probabilistic TRacking and Analysis of Clusters) based on quantifiable deviations from a hypothetical uniform distribution regarding the decisive piRNA cluster characteristics. We used piRNA sequences from human, macaque, mouse and rat to identify piRNA clusters in the respective species with proTRAC and compared the obtained results with piRNA cluster annotation from piRNABank and the results generated by different hitherto applied methods. ProTRAC identified clusters not annotated at piRNABank and rejected annotated clusters based on the absence of important features like strand asymmetry. That proTRAC detects clusters that are passed over if a minimum number of single-copy piRNA loci are required and that proTRAC assigns more sequence reads per cluster since it does not preclude frequently mapped reads from the analysis

Conclusions

Background

Results and Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 10, 2012
Citations: 173	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

ProTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Systematic and single cell analysis of Xenopus Piwi-interacting RNAs and Xiwi
Nelson C Lau ... Mark Borowsky
The EMBO Journal | VOL. 28
Nelson C Lau, et. al.Nelson C Lau ... Mark Borowsky
27 Aug 2009
The EMBO Journal | VOL. 28

Author response: The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters
Lisa Baumgartner ... Dominik Handler
-
Lisa Baumgartner, et. al.Lisa Baumgartner ... Dominik Handler
29 Jul 2022
29 Jul 2022

PiRNA clusters and open chromatin structure.
Soichiro Yamanaka ... Haruhiko Siomi
Mobile DNA | VOL. 5
Soichiro Yamanaka, et. al.Soichiro Yamanaka ... Haruhiko Siomi
01 Aug 2014
Mobile DNA | VOL. 5

Specialized piRNA Pathways Act in Germline and Somatic Tissues of the Drosophila Ovary
Colin D Malone ... Gregory J Hannon
Cell | VOL. 137
Colin D Malone, et. al.Colin D Malone ... Gregory J Hannon
23 Apr 2009
Cell | VOL. 137

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ProTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics