Classification of various genomic sequences based on distribution of repeated k-word.

Yong-Joon Song,Dong-Ho Cho

doi:10.1109/embc.2017.8037707

Classification of various genomic sequences based on distribution of repeated k-word.

Yong-Joon Song, Dong-Ho Cho

https://doi.org/10.1109/embc.2017.8037707

Copy DOI

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference	Publication Date: Jul 1, 2017
Citations: 9

Affiliation: Korea Advanced Institute of Science and Technology

#Alignment-based Methods #Alignment-free Methods + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In order to extract phylogenetic information from DNA sequences, alignment-free methods and alignment-based methods are used. Alignment-based methods have high complexity and conventional alignment-free methods have low accuracy. In this paper, a new alignment-free method based on the distribution of repeated k-word measure is proposed. This novel measure is based on k-words and its multiple repeated words. We can get higher performance than conventional word count methods in case of using proposed scheme while maintaining total time complexity. The proposed measure shows better performance compared to conventional alignment-free methods with respect to RF distance.

Full Text