The prediction of protein-protein interaction sites (PPIS) is currently crucial for regulating many biological activities in cells and developing drugs for various diseases. Deep learning-based methods have been proposed for predicting PPIS, significantly reducing the manpower and time costs associated with traditional experimental methods such as yeast two-hybrid, mass spectrometry, and affinity purification. However, the predictive accuracy of these deep learning methods has not yet reached the expected level. Therefore, we introduce a model called GACT-PPIS.The design of the GACT-PPIS algorithm aims to utilize combined information from protein sequences and structures as input to predict protein-protein interaction sites. The core of GACT-PPIS utilizes an Enhanced Graph Attention Network (EGAT) with initial residual and identity mappings, along with a deep Transformer network as the basic units, supplemented by Graph Convolutional Networks (GCN), effectively aggregating information from neighboring nodes for each node. After multiple network layers, the information of the entire protein is also fused into the nodes, and the Transformer network further enhances the model's performance.Experimental results show that GACT-PPIS outperforms the most representative models in terms of Recall, F1-measure, MCC, AUROC, and AUPRC on the benchmark test set (Test-60). Additionally, on other independent test sets (UBTest-31-6), GACT-PPIS leads in terms of Accuracy, Precision, Recall, F1-measure, MCC, AUROC, and AUPRC compared to the most representative models. It is worth noting that GACT-PPIS demonstrates excellent generalization and versatility across different test sets, showcasing good performance on multiple test sets for the same trained GACT-PPIS model.
Read full abstract