Abstract

Comparison of the binding sites of proteins is an effective means for predicting protein functions based on their structure information. Despite the importance of this problem and much research in the past, it is still very challenging to predict the binding ligands from the atomic structures of protein binding sites. Here, we designed a new algorithm, TIPSA (Triangulation-based Iterative-closest-point for Protein Surface Alignment), based on the iterative closest point (ICP) algorithm. TIPSA aims to find the maximum number of atoms that can be superposed between two protein binding sites, where any pair of superposed atoms has a distance smaller than a given threshold. The search starts from similar tetrahedra between two binding sites obtained from 3D Delaunay triangulation and uses the Hungarian algorithm to find additional matched atoms. We found that, due to the plasticity of protein binding sites, matching the rigid body of point clouds of protein binding sites is not adequate for satisfactory binding ligand prediction. We further incorporated global geometric information, the radius of gyration of binding site atoms, and used nearest neighbor classification for binding site prediction. Tested on benchmark data, our method achieved a performance comparable to the best methods in the literature, while simultaneously providing the common atom set and atom correspondences.

Highlights

  • The functions of individual proteins and genes are essential for understanding the functions of cells or organisms as a whole

  • We have developed a method based on the iterative closest point (ICP) algorithm [45,46] for superposing and comparing protein ligand binding sites using atom-level representation of protein surfaces

  • In addition to matched atoms, we incorporate other geometric information to further improve the accuracy in ligand classification

Read more

Summary

Introduction

The functions of individual proteins and genes are essential for understanding the functions of cells or organisms as a whole. We have developed a method based on the iterative closest point (ICP) algorithm [45,46] for superposing and comparing protein ligand binding sites using atom-level representation of protein surfaces. Compared to the original ICP algorithm, our algorithm starts from a multitude of initial local alignments derived from 3D Delaunay triangulations and uses the Hungarian algorithm to find additional matched atoms This Triangulation-based Iterative-closest-point for Protein Surface Alignment (TIPSA) algorithm aims to find the maximum common atom set (MCAS), defined as the maximum number of superposable atoms between two binding sites where distance between any pair of matched atoms in the rigid superposition of the binding sites is smaller than a given threshold value. This paper builds upon a preliminary study [47] which was based on local geometric information only, and features more thorough analysis including additional, more detailed results, and improvements to the algorithm that reduce computational cost while improving classification performance

Methods
Results
Method
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call