Implementing link prediction in protein networks via feature fusion models based on graph neural networks

Chi Zhang,Qian Gao,Ming Li,Tianfei Yu

doi:10.1016/j.compbiolchem.2023.107980

Abstract

MotivationProtein-protein interactions serve as the cornerstone for various biochemical processes within biological organisms. Existing research methodologies predominantly employ link prediction techniques to analyze these interaction networks. However, traditional approaches often fall short in delivering satisfactory predictive performance when applied to multi-species datasets. Current computational methods largely focus on analyzing the network topology, resulting in a somewhat monolithic feature set. The integration of diverse features in the model could potentially yield superior performance and broader applicability. To this end, we propose an autoencoder model built on graph neural networks, designed to enhance both predictive performance and generalizability by leveraging the integration of gene ontology. ResultsIn this research, we developed AGraphSAGE, a model specifically designed for analyzing protein-protein interaction network data. By seamlessly integrating gene ontology into the graph structure, we employed a dual-channel graph sampling and aggregation network that capitalizes on topological information to process high-dimensional features. Feature fusion is achieved through the implementation of graph attention mechanisms, and we adopted a link prediction framework as the experimental training model. Performance was evaluated on real-world datasets using key metrics, such as Area Under the Curve (AUC). A hyperparameter search space was established, and a Bayesian optimization strategy was applied to iteratively fine-tune the model, assessing the impact of various parameters on predictive efficacy. The experimental results validate that our proposed model is capable of effectively predicting protein-protein interactions across diverse biological species.

Full Text