Abstract

The nucleophilic substitution reactions of carboxylic esters (R1COOR2) follow 2 possible mechanisms, namely, stepwise mechanism and concerted mechanism. The reaction mechanism is affected by the structures of the nonleaving group (NLG, ie, R1), the leaving group (LG, ie, OR2) of carboxylic esters, and the nucleophiles (Nu). The traditional approach for the determination of reaction mechanism by dynamic experiments is not only laborious and time consuming but also only applicable for the situation with single structural factor variation. Benefiting from the ample existing kinetic data, this article aimed to construct a molecular structure-based classification model by support vector machine (SVM). The polarizability effect index of the NLG (PEI(NLG)), the pKa of the conjugate acid of the LG (pKa(LG)), and the pKa of the conjugate acid of the Nu (pKa(Nu)) were used to characterize the structural information of the 3 influencing factors, respectively. On the basis of these structural descriptors, the SVM classification model was established for the reaction mechanisms of 225 nucleophilic substitution reactions in the training set. The total prediction accuracy of 99.11% was achieved by 5-fold cross validation. Then the reaction mechanisms of 56 reactions in test set A were predicted by this SVM model, and the prediction accuracy was up to 98.21%. Furthermore, test set B consisting of 24 reactions were used as another external data set for prediction, where the reaction mechanisms were controversially reported by different studies. The prediction results of test set B by the SVM model built in this article were amazingly consistent with the conclusions drawn by Um et al.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call