K&lt;inf&gt;2&lt;/inf&gt;: Efficient alignment-free sequence similarity measurement using the Kendall statistic

Jie Lin Jie Lin,Donald A Adjeroh,Bing-Hua Jiang Bing-Hua Jiang,Yue Jiang Yue Jiang

doi:10.1109/bibm.2016.7822679

K<inf>2</inf>: Efficient alignment-free sequence similarity measurement using the Kendall statistic

Jie Lin Jie Lin, Donald A Adjeroh + Show 2 more

https://doi.org/10.1109/bibm.2016.7822679

Copy DOI

Publication Date: Dec 1, 2016

Citations: 2

Affiliation: Fujian Normal University, Software (Spain), West Virginia University, Thomas Jefferson University

#Alignment-free Comparison Methods #Alignment-free Comparison + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Alignment-free sequence comparison methods can compute the similarity between a large number of sequences much faster than methods that depend on sequence alignment. We propose a new alignment-free sequence comparison method, called K 2 , based on the non-parametric Kendall statistic. Compared with the state-of-the-art alignment-free comparison methods (e.g., D 2 , D 2 *, D 2 sh, and Chisquare(χ2) statistic), K 2 showed comparative power, demonstrating similar or better performance in computing the edit distance (similarity/dissimilarity) among a huge number of sequences. The K 2 approach was much faster than each of the other methods, especialy, with long sequence lengths.

Full Text