Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features

Hiroaki Iwata,Osamu Gotoh

doi:10.1093/nar/gks708

Abstract

Spliced alignment plays a central role in the precise identification of eukaryotic gene structures. Even though many spliced alignment programs have been developed, recent rapid progress in DNA sequencing technologies demands further improvements in software tools. Benchmarking algorithms under various conditions is an indispensable task for the development of better software; however, there is a dire lack of appropriate datasets usable for benchmarking spliced alignment programs. In this study, we have constructed two types of datasets: simulated sequence datasets and actual cross-species datasets. The datasets are designed to correspond to various real situations, i.e. divergent eukaryotic species, different types of reference sequences, and the wide divergence between query and target sequences. In addition, we have developed an extended version of our program Spaln, which incorporates two additional features to the scoring scheme of the original version, and examined this extended version, Spaln2, together with the original Spaln and other representative aligners based on our benchmark datasets. Although the effects of the modifications are not individually striking, Spaln2 is consistently most accurate and reasonably fast in most practical cases, especially for plants and fungi and for increasingly divergent pairs of target and query sequences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Jul 30, 2012
Citations: 195	License type: CC BY-NC 3.0

R Discovery Prime

R Discovery Prime

Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

The presence of signal peptide significantly affects transmembrane topology prediction.
Demelo M Lao ... Toshio Shimizu
Bioinformatics | VOL. 18
Demelo M Lao, et. al.Demelo M Lao ... Toshio Shimizu
01 Dec 2002
Bioinformatics | VOL. 18

Multiple Protein Domains Mediate Interaction between Bcl10 and MALT1
Felicia D Langel ... Brian C Schaefer
Journal of Biological Chemistry | VOL. 283
Felicia D Langel, et. al.Felicia D Langel ... Brian C Schaefer
01 Nov 2008
Journal of Biological Chemistry | VOL. 283

Exemplary Sequence Cardinality: An effective application for word spotting
Tanmoy Mondal ... Umapada Pal
-
Tanmoy Mondal, et. al.Tanmoy Mondal ... Umapada Pal
01 Aug 2015
01 Aug 2015

CRISPR-GE: A Convenient Software Toolkit for CRISPR-Based Genome Editing
Xianrong Xie ... Yao-Guang Liu
Molecular Plant | VOL. 10
Xianrong Xie, et. al.Xianrong Xie ... Yao-Guang Liu
15 Jun 2017
Molecular Plant | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research