Abstract
BackgroundThe analysis of differential splicing (DS) is crucial for understanding physiological processes in cells and organs. In particular, aberrant transcripts are known to be involved in various diseases including cancer. A widely used technique for studying DS are exon arrays. Over the last decade a variety of algorithms for the detection of DS events from exon arrays has been developed. However, no comprehensive, comparative evaluation including sensitivity to the most important data features has been conducted so far. To this end, we created multiple data sets based on simulated data to assess strengths and weaknesses of seven published methods as well as a newly developed method, KLAS. Additionally, we evaluated all methods on two cancer data sets that comprised RT-PCR validated results.ResultsOur studies indicated ARH as the most robust methods when integrating the results over all scenarios and data sets. Nevertheless, special cases or requirements favor other methods. While FIRMA was highly sensitive according to experimental data, SplicingCompass, MIDAS and ANOSVA showed high specificity throughout the scenarios. On experimental data ARH, FIRMA, MIDAS, and KLAS performed best.ConclusionsEach method shows different characteristics regarding sensitivity, specificity, interference to certain data settings and robustness over multiple data sets. While some methods can be considered as generally good choices over all data sets and scenarios, other methods show heterogeneous prediction quality on the different data sets. The adequate method has to be chosen carefully and with a defined study aim in mind.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1322-x) contains supplementary material, which is available to authorized users.
Highlights
The analysis of differential splicing (DS) is crucial for understanding physiological processes in cells and organs
Analysis of variance was applied to determine the significance of parameter influence
The results on the colon and lung cancer data sets were reported with a focus on the RTPCR validated results
Summary
The analysis of differential splicing (DS) is crucial for understanding physiological processes in cells and organs. No comprehensive, comparative evaluation including sensitivity to the most important data features has been conducted so far. To this end, we created multiple data sets based on simulated data to assess strengths and weaknesses of seven published methods as well as a newly developed method, KLAS. One possibility to capture such alterations is provided by exon arrays In comparison to their more coarse-grained predecessors, the gene arrays, they offer an exon-based resolution [4]. This possibility led to wide-spread usage, reflected by over 15,000 samples across many different tissues deposited into GEO [5]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.