EmPDBA: protein-DNA binding affinity prediction by combining features from binding partners and interface learned with ensemble regression model.

Shuang Yang,Xiaohan Sun,Tong Zhou,Lei Chen,Wenxue Zhou,Weikang Gong,Chunhua Li

doi:10.1093/bib/bbad192

Abstract

Protein-deoxyribonucleic acid (DNA) interactions are important in a variety of biological processes. Accurately predicting protein-DNA binding affinity has been one of the most attractive and challenging issues in computational biology. However, the existing approaches still have much room for improvement. In this work, we propose an ensemble model for Protein-DNA Binding Affinity prediction (emPDBA), which combines six base models with one meta-model. The complexes are classified into four types based on the DNA structure (double-stranded or other forms) and the percentage of interface residues. For each type, emPDBA is trained with the sequence-based, structure-based and energy features from binding partners and complex structures. Through feature selection by the sequential forward selection method, it is found that there do exist considerable differences in the key factors contributing to intermolecular binding affinity. The complex classification is beneficial for the important feature extraction for binding affinity prediction. The performance comparison of our method with other peer ones on the independent testing dataset shows that emPDBA outperforms the state-of-the-art methods with the Pearson correlation coefficient of 0.53 and the mean absolute error of 1.11kcal/mol. The comprehensive results demonstrate that our method has a good performance for protein-DNA binding affinity prediction. Availability and implementation: The source code is available at https://github.com/ChunhuaLiLab/emPDBA/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EmPDBA: protein-DNA binding affinity prediction by combining features from binding partners and interface learned with ensemble regression model.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics

Lead the way for us

Journal: Briefings in Bioinformatics	Publication Date: May 16, 2023
Citations: 4

Similar Papers

Improving classification accuracy for separation of area under crops based on feature selection from multi-temporal images and machine learning algorithms
Mostafa Kabolizadeh ... Khalil Habashi
Advances in Space Research | VOL. 72
Mostafa Kabolizadeh, et. al.Mostafa Kabolizadeh ... Khalil Habashi
22 Sep 2023
Advances in Space Research | VOL. 72

PreDBA: A heterogeneous ensemble approach for predicting protein-DNA binding affinity
Wenyi Yang ... Lei Deng
Scientific Reports | VOL. 10
Wenyi Yang, et. al.Wenyi Yang ... Lei Deng
28 Jan 2020
Scientific Reports | VOL. 10

Feature selection for automatic classification of musical instrument sounds
Mingchun Liu ... Chunru Wan
-
Mingchun Liu, et. al.Mingchun Liu ... Chunru Wan
01 Jan 2001
01 Jan 2001

A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class.
Qianwu Ni ... Lei Chen
Combinatorial Chemistry & High Throughput Screening | VOL. 20
Qianwu Ni, et. al.Qianwu Ni ... Lei Chen
23 Oct 2017
Combinatorial Chemistry & High Throughput Screening | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EmPDBA: protein-DNA binding affinity prediction by combining features from binding partners and interface learned with ensemble regression model.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics