Iterative feature representations improve N4-methylcytosine site prediction.

Leyi Wei,Shasha Luan,Quan Zou,Balachandran Manavalan,Zhijun Liao,Xiaolong Shi,Ran Su

doi:10.1093/bioinformatics/btz408

Abstract

Accurate identification of N4-methylcytosine (4mC) modifications in a genome wide can provide insights into their biological functions and mechanisms. Machine learning recently have become effective approaches for computational identification of 4mC sites in genome. Unfortunately, existing methods cannot achieve satisfactory performance, owing to the lack of effective DNA feature representations that are capable to capture the characteristics of 4mC modifications. In this work, we developed a new predictor named 4mcPred-IFL, aiming to identify 4mC sites. To represent and capture discriminative features, we proposed an iterative feature representation algorithm that enables to learn informative features from several sequential models in a supervised iterative mode. Our analysis results showed that the feature representations learnt by our algorithm can capture the discriminative distribution characteristics between 4mC sites and non-4mC sites, enlarging the decision margin between the positives and negatives in feature space. Additionally, by evaluating and comparing our predictor with the state-of-the-art predictors on benchmark datasets, we demonstrate that our predictor can identify 4mC sites more accurately. The user-friendly webserver that implements the proposed 4mcPred-IFL is well established, and is freely accessible at http://server.malab.cn/4mcPred-IFL. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Iterative feature representations improve N4-methylcytosine site prediction.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: May 17, 2019
Citations: 112

Similar Papers

Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.
Leyi Wei ... Luis Augusto Eijy Nagai
Bioinformatics | VOL. 35
Leyi Wei, et. al.Leyi Wei ... Luis Augusto Eijy Nagai
19 Sep 2018
Bioinformatics | VOL. 35

PSP-PJMI: An innovative feature representation algorithm for identifying DNA N4-methylcytosine sites
Mingzhao Wang ... Shengquan Xu
Information Sciences | VOL. 606
Mingzhao Wang, et. al.Mingzhao Wang ... Shengquan Xu
20 May 2022
Information Sciences | VOL. 606

IDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes
Hao Lv ... Meng-Lu Liu
SSRN Electronic Journal | VOL. -
Hao Lv, et. al.Hao Lv ... Meng-Lu Liu
01 Jan 2020
SSRN Electronic Journal | VOL. -

I4mC-Mouse: Improved identification of DNA N4-methylcytosine sites in the mouse genome using multiple encoding schemes
Md Mehedi Hasan ... Hiroyuki Kurata
Computational and Structural Biotechnology Journal | VOL. 18
Md Mehedi Hasan, et. al.Md Mehedi Hasan ... Hiroyuki Kurata
01 Jan 2020
Computational and Structural Biotechnology Journal | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Iterative feature representations improve N4-methylcytosine site prediction.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics