Abstract Study question Could chatbots potentially be used as tools for both patients’ and physicians’ education in an infertility journey? Summary answer Artificial Intelligence (AI) is not yet in a position to give clear, evidence-based recommendations in the field of fertility, particularly concerning embryo transfer. What is known already The rapid development of AI has raised questions about its potential uses in different sectors of everyday life. Individuals from diverse fields have tried to incorporate its usage in their personal but also in their professional lives, with varying levels of success. Specifically in medicine, ChatGPT has been studied intensely with more than 400 related articles having been indexed in PubMed until May 2023, while AI has reportedly also succeeded in several standardized medical tests and passed several Medical Board examinations. Hence the question arises whether chatbots could be used as tools for clinical decision-making or patients’ and physicians’ education. Study design, size, duration We used nine of the most popular free AI chatbots available (ChatGPT, Bard, Writesonic, You, Perplexity, Learnt, Bing, Magickpen, and Rytr) and entered the following command: “Write me a 300-word scientific essay about evidence-based methods that can improve the outcomes of embryo transfer” in May 2023. We collected the responses and extracted the methods each chatbot suggested. When sufficient similarity among answers was present we categorized the answers under one category to facilitate the study. Participants/materials, setting, methods We calculated descriptive statistics and the prevalence of each response. We used as a comparator for widely acceptable practices that are proven to improve embryo transfer outcomes the 2017 ASRM guideline on performing embryo transfer taking also into consideration more current literature. Data was compared using chi-squared tests and a level of p < 0.05 was used as the threshold of statistical significance. Main results and the role of chance The range of the recommendations was from one to nine per chatbot (median = 4, IQR = 2) with an average of 4.78 suggestions per chatbot. Out of a total of 43 recommendations, which could be grouped into 19 similar categories, only 3/19 (15.8%) were evidence-based practices, those being “ultrasound-guided embryo transfer”, which was also the most commonly appearing response, appearing in 7/9 (77.8%) chatbots, “single embryo transfer” appearing in 4/9 (44.4%) chatbots and “use of a soft catheter” in 2/9 (22.2%) chatbots. Some controversial responses appeared even more often than the two latter evidence-based ones with “preimplantation genetic testing (PGT)” being the second most common response with 6/9 chatbots suggesting it (66.7%) and the vague suggestion of “optimal endometrium preparation” being in the third place with 5/9 chatbots (55.6%), both non-evidence-based practices to improve embryo transfer. The majority of the recommendations were unique, with 10 answers appearing only once (1/9; 11.1%), those being “endometrial scratching”, “natural cycle embryo transfer”, “use of GnRH antagonist protocol”, “blastocyst transfer”, “use of specialized catheter”, “preimplantation genetic screening (PGS)”, “mock embryo transfer”, “uninterrupted embryo culture”, “minimizing transfer time”, and “maintaining temperature and pH of the culture media”. Limitations, reasons for caution Firstly, we only used some of the most popular chatbots and not all existing ones. Additionally, chatbots tend to give different answers when repeatedly asked the same questions. However, we believe that our study effectively captures the prevailing practices of chatbot users, who typically pose a question only once. Wider implications of the findings Both patients and physicians should be wary of guiding care based on chatbot recommendations in infertility since the majority of responses consists of scientifically unsupported recommendations. Chatbot results might improve with time especially if trained to obtain information from validated medical databases, however, this will have to be verified scientifically. Trial registration number N/A
Read full abstract