MF-SuP-pKa: Multi-fidelity modeling with subgraph pooling mechanism for pKa prediction.

Jialu Wu,Chang-Yu Hsieh,Dongsheng Cao,Yue Wan,Shengyu Zhang,Tingjun Hou,Zhenxing Wu

doi:10.1016/j.apsb.2022.11.010

Abstract

Acid-base dissociation constant (pKa) is a key physicochemical parameter in chemical science, especially in organic synthesis and drug discovery. Current methodologies for pKa prediction still suffer from limited applicability domain and lack of chemical insight. Here we present MF-SuP-pKa (multi-fidelity modeling with subgraph pooling for pKa prediction), a novel pKa prediction model that utilizes subgraph pooling, multi-fidelity learning and data augmentation. In our model, a knowledge-aware subgraph pooling strategy was designed to capture the local and global environments around the ionization sites for micro-pKa prediction. To overcome the scarcity of accurate pKa data, low-fidelity data (computational pKa) was used to fit the high-fidelity data (experimental pKa) through transfer learning. The final MF-SuP-pKa model was constructed by pre-training on the augmented ChEMBL data set and fine-tuning on the DataWarrior data set. Extensive evaluation on the DataWarrior data set and three benchmark data sets shows that MF-SuP-pKa achieves superior performances to the state-of-the-art pKa prediction models while requires much less high-fidelity training data. Compared with Attentive FP, MF-SuP-pKa achieves 23.83% and 20.12% improvement in terms of mean absolute error (MAE) on the acidic and basic sets, respectively.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Acta Pharmaceutica Sinica B	Publication Date: Jun 1, 2023
Citations: 15	License type: cc-by-nc-nd

R Discovery Prime

MF-SuP-pKa: Multi-fidelity modeling with subgraph pooling mechanism for pKa prediction.

Abstract

Published Version

Talk to us

Similar Papers

More From: Acta Pharmaceutica Sinica B

Lead the way for us

Similar Papers

The effects of scale factor and correction on the multi-fidelity model
Seok-Ho Son ... Dong-Hoon Choi
Journal of Mechanical Science and Technology | VOL. 30
Seok-Ho Son, et. al.Seok-Ho Son ... Dong-Hoon Choi
01 May 2016
Journal of Mechanical Science and Technology | VOL. 30

Design approach for tilt propellers of UAM/eVTOLs for cruise and hover considering aerodynamic and aeroacoustic characteristics via a multi-fidelity model
Yingzhe Ye ... Kefu Huang
Aerospace Science and Technology | VOL. 156
Yingzhe Ye, et. al.Yingzhe Ye ... Kefu Huang
19 Nov 2024
Aerospace Science and Technology | VOL. 156

Graph transformer based transfer learning for aqueous pKa prediction of organic small molecules
Yuxin Qiu ... Zhen Song
Chemical Engineering Science | VOL. 300
Yuxin Qiu, et. al.Yuxin Qiu ... Zhen Song
31 Jul 2024
Chemical Engineering Science | VOL. 300

Multi-fidelity Data Aggregation using Convolutional Neural Networks
Jie Chen ... Yongming Liu
Computer Methods in Applied Mechanics and Engineering | VOL. 391
Jie Chen, et. al.Jie Chen ... Yongming Liu
12 Jan 2022
Computer Methods in Applied Mechanics and Engineering | VOL. 391

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

MF-SuP-pKa: Multi-fidelity modeling with subgraph pooling mechanism for pKa prediction.

Abstract

Published Version

Talk to us

Similar Papers

More From: Acta Pharmaceutica Sinica B