Abstract
Protein-deoxyribonucleic acid (DNA) interactions are important in a variety of biological processes. Accurately predicting protein-DNA binding affinity has been one of the most attractive and challenging issues in computational biology. However, the existing approaches still have much room for improvement. In this work, we propose an ensemble model for Protein-DNA Binding Affinity prediction (emPDBA), which combines six base models with one meta-model. The complexes are classified into four types based on the DNA structure (double-stranded or other forms) and the percentage of interface residues. For each type, emPDBA is trained with the sequence-based, structure-based and energy features from binding partners and complex structures. Through feature selection by the sequential forward selection method, it is found that there do exist considerable differences in the key factors contributing to intermolecular binding affinity. The complex classification is beneficial for the important feature extraction for binding affinity prediction. The performance comparison of our method with other peer ones on the independent testing dataset shows that emPDBA outperforms the state-of-the-art methods with the Pearson correlation coefficient of 0.53 and the mean absolute error of 1.11kcal/mol. The comprehensive results demonstrate that our method has a good performance for protein-DNA binding affinity prediction. Availability and implementation: The source code is available at https://github.com/ChunhuaLiLab/emPDBA/.
Full Text
Topics from this Paper
Sequential Forward Selection Method
protein-DNA Binding Affinity
Ensemble Regression Model
Binding Partners
Protein-deoxyribonucleic Acid
+ Show 5 more
Create a personalized feed of these topics
Get StartedSimilar Papers
iScience
Mar 1, 2020
Future Drug Discovery
May 5, 2023
Scientific Reports
Jan 28, 2020
Structure
Jun 1, 2018
Combinatorial Chemistry & High Throughput Screening
Oct 23, 2017
Jan 1, 2001
Molecular Therapy - Nucleic Acids
Jun 1, 2021
Thermal Engineering
Mar 1, 2019
Journal of chemical information and modeling
May 26, 2023
Ocean Engineering
Feb 1, 2023
Journal of Biological Chemistry
Oct 1, 2019
Proteins: Structure, Function, and Bioinformatics
Sep 14, 2013
Water, Air, & Soil Pollution
Jun 1, 2020
IEEE Access
Jan 1, 2021
Molecular Diversity
Aug 1, 2007
Briefings in bioinformatics
Briefings in bioinformatics
Sep 15, 2023
Briefings in bioinformatics
Sep 5, 2023
Briefings in bioinformatics
Sep 5, 2023
Briefings in bioinformatics
Sep 5, 2023
Briefings in bioinformatics
Sep 5, 2023
Briefings in bioinformatics
Sep 4, 2023
Briefings in bioinformatics
Sep 4, 2023
Briefings in bioinformatics
Aug 31, 2023
Briefings in bioinformatics
Aug 31, 2023
Briefings in bioinformatics
Aug 31, 2023