Phylogenetic analysis of DNA sequences based on [formula omitted]-word and rough set theory

Chun Li,Yan Yang,Meiduo Jia,Yingying Zhang,Xiaoqing Yu,Changzhong Wang

doi:10.1016/j.physa.2013.12.025

Abstract

Among alignment-free methods for sequence comparison, the model of k-word frequencies is a well-developed one. However, most existing word-based methods neglect relationships among k-word frequencies, while a few others focus on the correlation of k-words but ignore the word frequency itself. In this paper, we propose a new k-word method which succeeds in conquering the two problems.By means of characteristic sequences of a DNA sequence, we construct a 3×2k dimensional complete word-based vector. Then we present a feature selection scheme based on rough set theory (RST) to extract the most informative k-words and use only these selected features to represent the DNA sequence. To evaluate the effectiveness of our method, we test it by phylogenetic analysis on three datasets. The first one is used as a training set, by which 869 top ranked k-words are selected. The other two are used as the testing set. The results demonstrate that the proposed method can capture more important information and is more efficient for molecular phylogenetic analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Phylogenetic analysis of DNA sequences based on [formula omitted]-word and rough set theory

Abstract

Talk to us

Similar Papers

More From: Physica A: Statistical Mechanics and its Applications

Lead the way for us

Journal: Physica A: Statistical Mechanics and its Applications	Publication Date: Dec 22, 2013
Citations: 17

Similar Papers

Variable precision multigranulation rough fuzzy set approach to multiple attribute group decision-making based on λ-similarity relation
Bingzhen Sun ... Xiangtang Chen
Computers & Industrial Engineering | VOL. 127
Bingzhen Sun, et. al.Bingzhen Sun ... Xiangtang Chen
06 Oct 2018
Computers & Industrial Engineering | VOL. 127

Double-quantitative rough fuzzy set based decisions: A logical operations method
Bingjiao Fan ... Jianhang Yu
Information Sciences | VOL. 378
Bingjiao Fan, et. al.Bingjiao Fan ... Jianhang Yu
27 May 2016
Information Sciences | VOL. 378

Algorithm and axiomatization of rough fuzzy sets based finite dimensional fuzzy vectors
Mingfen Wu
Frontiers of Computer Science in China | VOL. 3
Mingfen WuMingfen Wu
22 Oct 2009
Frontiers of Computer Science in China | VOL. 3

An Effective Rough Neutrosophic Based Approach for Data Pre-Processing
Siti Nur Aisyah Mohd Zainal ... Ahmad Termimi Ab Ghani
TEM Journal | VOL. -
Siti Nur Aisyah Mohd Zainal, et. al.Siti Nur Aisyah Mohd Zainal ... Ahmad Termimi Ab Ghani
29 May 2023
TEM Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phylogenetic analysis of DNA sequences based on [formula omitted]-word and rough set theory

Abstract

Talk to us

Similar Papers

More From: Physica A: Statistical Mechanics and its Applications