Pairwise DNA Alignment with Sequence Specific Transition-Transversion Ratio Using Multiple Parameter Sets

Ankit Agrawal,Xiaoqiu Huang

doi:10.1109/icit.2008.62

Abstract

Pairwise DNA and protein sequence alignment is an underlying task in bioinformatics which forms the basis of many other bioinformatics applications. Protein sequence alignment is in general given more importance than DNA sequence alignment, and protein sequence alignment methods can usually be used with little modification for DNA sequences as well. However, alignment methods specific to DNA sequence alignment using sequence specific information are highly desirable. Most existing DNA alignment programs routinely use the common match/mismatch scoring scheme. Recently, an iterative alignment scheme using sequence-specific transition-transversion ratio was shown to be better than using a simple match/mismatch scoring scheme. In this paper, we present a modification to the iterative approach by incorporating in it the use of multiple parameter sets. Preliminary experiments indicate that using multiple parameter sets gives significantly better performance than using a single parameter set, and than using a simple match/mismatch scoring scheme. Sequence specific scoring matrices have been shown to be highly successful for protein alignment over the last decade, and the current work should be a significant step in the direction of using sequence specific substitution matrices for DNA sequences.

Full Text