Abstract Study question Is it possible to predict clinical pregnancy using Machine Learning models on day 6 after embryo transfer to an acceptable degree of certainty? Summary answer Using selected biochemical markers it is possible to build a machine learning model to predict clinical pregnancy. What is known already Human chorionic gonadotropin (β-hCG) is considered the main early predictor of pregnancy. Due to the fact that the highest percentage of spontaneous abortions occurs in the first trimester predicting clinical pregnancy outcomes at the early stages of pregnancy is particularly difficult. The level of B-hcg measured at the early stages of implantation and the dynamics of its increase can help predict clinical pregnancy and predict its further development. However, the specific cut-off points at which pregnancy can be determined vary from study to study, and a different factor is often suggested to increase the certainty of β-hCG-based predictions. Study design, size, duration 1474 single frozen embryo transfers (1205 patients between 23 and 47 years of age), January 2019 -December 2022 were used for data modelling and statistical testing. The dataset consisted of endocrinal markers:β-hCG, PRG (progesterone), E (estradiol) measured on the 6th day after ET and other practitioner-chosen data: age, PGD results, embryo quality status, RIF or RPL history, embryo maturation days as well as pregnancy outcomes. Exclusion criteria were: PRG > 50ng/ml, β-hCG >70 mIU/ml. Participants/materials, setting, methods A machine learning model for predicting clinical pregnancy was trained using the gradient boosting technique. The model used 100 decision trees with 8 leaves, maximal depth of 2 nodes and a learning rate of 0.1. The performance of the model was measured using the accuracy score metric and area under ROC (AUC). Feature importance was based on Shapley values. Additional analysis was performed using statistical testing using a two-sided Student’s t-test. Main results and the role of chance A Machine Learning naive model employed chosen variables to predict clinical pregnancy. The most important variables turned out to be the serum level of β-hCG and PRG on day 6 after embryo-transfer. The final model, limited to those variables, performs with an accuracy of around 83 % (accuracy score = 0.834), the area under ROC (AUC) was 0.927. Based on Shapley values, β-hCG followed by PRG are the most relevant for the model’s predictions. Low PRG concentrations increased the chances of clinical pregnancy. The findings were confirmed by additional statistical analysis. Observations were divided into two groups for which the cut-off point was the level of PRG of 24 ng/ml – the median for the whole group. The probability of clinical pregnancy is higher in the group where the serum PRG is below 24 ng/ml even if β-hCG levels are the same. A statistically significant difference between these groups regarding β-hCG (P value < 0.001) was noticed. In order for serum β-hCG to be used as a more accurate clinical pregnancy predictor its level should be interpreted in the context of PRG serum levels. Limitations, reasons for caution Since in the dataset, only data regarding frozen embryo transfers were included, fresh ETs need to be explored further. The model was based on endocrinal markers as measured on day six after ET hence it cannot be used to interpret those markers as measured on any other date. Wider implications of the findings In order to most accurately inform patients regarding the possible outcomes of embryo-transfers, practitioners should not limit themselves to monitoring β-hCG serum levels but also include PRG as an important prognostic of clinical pregnancy. Trial registration number Not Applicable
Read full abstract