Using the Chou’s Pseudo Component to Predict the ncRNA Locations Based on the Improved K-Nearest Neighbor (iKNN) Classifier

Chengyan Wu,Guo-Liang Fan,Qianzhong Li,Ru Xing

doi:10.2174/1574893614666191003142406

Abstract

Background: The non-coding RNA identification at the organelle genome level is a challenging task. In our previous work, an ncRNA dataset with less than 80% sequence identity was built, and a method incorporating an increment of diversity combining with support vector machine method was proposed. Objective: Based on the ncRNA_361 dataset, a novel decision-making method-an improved KNN (iKNN) classifier was proposed. Methods: In this paper, based on the iKNN algorithm, the physicochemical features of nucleotides, the degeneracy of genetic codons, and topological secondary structure were selected to represent the effective ncRNA characters. Then, the incremental feature selection method was utilized to optimize the feature set. Results: The results of iKNN indicated that the decision-making method of mean value is distinctly superior to the traditional decision-making method of majority vote the Increment of Diversity Combining Support Vector Machine (ID-SVM). The iKNN algorithm achieved an overall accuracy of 97.368% in the jackknife test, when k=3. Conclusion: It should be noted that the triplets of the structure-sequence mode under reading frames not only contains the entire sequence information but also reflects whether the base was paired or not, and the secondary structural topological parameters further describe the ncRNA secondary structure on the spatial level. The ncRNA dataset and the iKNN classifier are freely available at http://202.207.14.87:8032/fuwu/iKNN/index.asp.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using the Chou’s Pseudo Component to Predict the ncRNA Locations Based on the Improved K-Nearest Neighbor (iKNN) Classifier

Abstract

Talk to us

Similar Papers

More From: Current Bioinformatics

Lead the way for us

Journal: Current Bioinformatics	Publication Date: Nov 11, 2020
Citations: 6

Similar Papers

مقایسه روش های برنامه ریزی ژنتیک و ماشین بردار پشتیبان در پیش بینی جریان روزانه رودخانه (مطالعه موردی: رودخانه باراندوزچای)
...
-
, et. al. ...
20 Oct 2014
20 Oct 2014

Detection of Rice Fields in Sleman District using SVM (Support Vector Machine) Method
Sulidar Fitri ... Novi Nurjanah
Journal of Physics: Conference Series | VOL. 1179
Sulidar Fitri, et. al.Sulidar Fitri ... Novi Nurjanah
01 Jul 2019
Journal of Physics: Conference Series | VOL. 1179

Cyberbullying comment classification on Indonesian Selebgram using support vector machine method
Miftah Andriansyah ... Remi Senjaya
-
Miftah Andriansyah, et. al.Miftah Andriansyah ... Remi Senjaya
01 Nov 2017
01 Nov 2017

Intelligent regression algorithm study based on performance and NOx emission experimental data of a hydrogen enriched natural gas engine
Yue Huang ... Fanhua Ma
International Journal of Hydrogen Energy | VOL. 41
Yue Huang, et. al.Yue Huang ... Fanhua Ma
14 May 2016
International Journal of Hydrogen Energy | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using the Chou’s Pseudo Component to Predict the ncRNA Locations Based on the Improved K-Nearest Neighbor (iKNN) Classifier

Abstract

Talk to us

Similar Papers

More From: Current Bioinformatics