The application of deep learning technology in the field of materials science provides a new method for predicting the adsorption energy of high-performance alloy catalysts in hydrogen evolution reactions and material discovery. The activity and selectivity of catalytic materials are mainly influenced by the properties and positions of active sites and adsorption sites. However, current deep learning models have not sufficiently focused on the importance of active atoms and adsorbates, instead placing more emphasis on the overall structure of the catalytic materials. In this paper, the overall molecular graph and a masked graph, which ignores fixed atoms, are separately input into the Masked Graph Transformer (MGT) network to enhance the model's ability to recognize key sites in catalytic reactions. Second, we introduce a nonlinear message-passing mechanism to improve the dot-product attention in the Transformer and capture the directional information on the relative positions of nodes by integrating molecular geometric information through deep tensor products. Subsequently, we constructed the NLMP-TransNet framework, which combines MPNN and Transformer and optimizes the model's learning and prediction capabilities through weight sharing and residual connections. The MGT achieves an error rate of 0.5447 eV on the small data set OC20-Ni, surpassing existing technologies. Ablation studies confirm the necessity of focusing on site features for accurate adsorption energy prediction. Code is available at https://github.com/KristinSun/OCP-MGT.git.
Read full abstract