Study questionTo construct prediction models based on the Bayesian network (BN) learning method for the probability of fertilization failure (including low fertilization rate [LRF] and total fertilization failure [TFF]) in assisted reproductive technology (ART) treatment.Summary answerA BN model was developed to predict TFF/LFR. The model showed relatively high calibration in external validation, which could facilitate the identification of risk factors for fertilization disorders and improve the efficiency of in vitro fertilization/intracytoplasmic sperm injection (IVF/ICSI) treatment.What is known alreadyThe prediction of TFF/LFR is very complex. Although some studies attempted to construct prediction models for TFF/LRF, most of the reported models were based on limited variables and traditional regression-based models, which are unsuitable for analyzing real-world clinical data. Therefore, none of the reported models have been widely used in routine clinical practice. To date, BN modeling analysis is a prominent and increasingly popular machine learning method that is powerful in dealing with dynamic and complex real-world data.Study design, size, durationA retrospective study was performed with 106,640 fresh embryo IVF/ICSI cycles from 2009 to 2019 in one of China's largest reproductive health centers.Participants/materials, setting, methodsA total of 106, 640 cycles were included in this study, including 97,102 controls, 4,339 LFR cases, and 5,199 TFF cases. Twenty-four predictors were initially included, including 13 female-related variables, five male-related variables, and six variables related to IVF/ICSI treatment. BN modeling analysis with tenfold cross-validation was performed to construct the predictive model for TFF/LFR. The receiver operating characteristic (ROC) curves and the corresponding area under the curves (AUCs) were used to evaluate the performance of the BN model.Main results and the role of chanceAll twenty-four predictors were first organized into seven hierarchical layers in a theoretical BN model, according to prior knowledge from previous literature and clinical practice. A machine-learning BN model was generated based on real-world clinical data, containing a total of eighteen predictors, of which the infertility type, ART method, and number of retrieved oocytes directly influence the probabilities of LFR/TFF. The prediction accuracy of the BN model was 91.7%. The AUC of the TFF versus control groups was 0.779 (95% CI: 0.766-0.791), with a sensitivity of 71.2% and specificity of 70.1%; the AUC of of TFF versus LFR groups was 0.807 (95% CI: 0.790-0.824), with a sensitivity of 49.0% and specificity of 99.0%.Limitations, reason for cautionFirst, our study was based on clinical data from a single center, and the results of this study should be further verified by external data. In addition, some critical data (e.g., the detailed IVF laboratory parameters of the sperm and oocytes used for insemination) were not available in this study, which should be given full consideration when further improving the performance of the BN model.Wider implications of the findingsBased on extensive clinical real-world data, we developed a BN model to predict the probabilities of fertilization failures in ART, which provides new clues for clinical decision-making support for clinicians in formulating personalized treatment plans and further improving ART treatment outcomes.Study funding/competing interest(s)Dr. Y. Wang was supported by grants from the Beijing Municipal Science & Technology Commission (Z191100006619086). We declare that there are no conflicts of interest.Trial registration numberN/A.