Abstract

Understanding the relationship between molecular structure and metabolic pathway classes is significant for optimizing drug metabolization. In bioinformatics, graph neural networks (GNNs) can effectively capture structural and semantic features for molecular representation. Graph neural networks have become an essential method to encode molecular structures for multi-label prediction of metabolic pathways. However, building a GNN model for a given molecular structure dataset requires the manual design of GNN structure and fine-tuning of the hyperparameters for the GNN model, which is time-consuming and relies on expert experience. In this paper, we design an automatic end-to-end molecular structure representation learning framework named Auto-MSR that can design a GNN model for molecular structure encoding with little manual intervention. For a given compound molecule structure dataset, Auto-MSR first uses an efficient pa rallel GNN structure search algorithm to identify the optimal GNN structure from the GNN structure subspace. Then, it adopts a tree-structured parzen estimator approach to obtain the best hyperparameters of the GNN model in the hyperparameters subspace. We test AutoMSR on the dataset KEGG based on the multi-label metabolic pathway prediction task. The comparing results show that AutoMSR outperforms state-of-art manual graph neural networks on performance.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call