Ranking Algorithm Research Articles

Abstract Study question Deep-learning algorithms are known to be non-robust: can the variability and inconsistency of AI algorithms be reduced in embryo selection? Summary answer We reduced the variability of algorithms (measured on different tasks like rotations and brightness changes) by 86% while preserving their quality. What is known already Deep-learning methods are generally known to be non-robust, i.e., decisions change with even slight modification of input data. Current solutions for embryo scoring are not robust - for example rotating the input image results in a different score in most solutions on the market. Despite this fact and expressed concerns of embryologists, there are no other publications focusing on the problem of variance in AI solutions used in IVF. Most of the publications measure accuracy, sensitivity, specificity, and ROC AUC; there are no variance metrics. Study design, size, duration The data-set was collected within multiple clinics using various devices. It contains 34,821 embryos (4,510 were transferred with known pregnancy results), represented by time-lapse videos or images. This gives 3,290,481 frames of embryos at various maturity levels. From the data-set 925 randomly selected embryos were chosen as a test set. The frames were modified by methods that are not supposed to change the results of the algorithm. We measured the variability of the scores given by our algorithm. Participants/materials, setting, methods We have considered seven different modifications of images that should not influence embryo scoring: • Rotations (10 different angles); • Brightness and Contrast modifications; • Substitutions of Frames (from time-lapse monitoring taken from a 2 hours interval); • Blur (Generalised Normal filter); • Gaussian Noise; • Gaussian Blur; • Sharpening. We used several techniques to reduce variance of our deep neural network model (architecture commonly used for embryo selection): • Ensemble (of different models in cross validation); • Test time augmentation (TTA); • Robust training. Main results and the role of chance In order to measure the variance we have used the following method. First, the scores are stretched to the standard uniform distribution. In other words we look in which percentile the score lies. This way the range of the scores are normalised thus the variance can be compared. Second, we train the EMBROAID model on the augmented data that includes all the above modifications. Third, we compute the variance of the normalised scores on the test set. The mean variance dropped by 86% (0.0055 to 0.0008) across all measured input modifications. The individual drops in the variance on measured input modifications: Rotations: 77% (0.009 -&gt; 0.002), Brightness and Contrast: 81% (0.0036 -&gt; 0.0007), Substitution of Frames: 76% (0.0076 -&gt; 0.0019), Blur 94% (0.012 -&gt; 0.0008), Gaussian Noise: 96% (0.0049 -&gt; 0.0002), Gaussian Blur: 95% (0.0052 -&gt; 0.0003), Sharpening: 77% (0.0015 -&gt; 0.0003). The significance was tested with Wilcoxon Rank Sum Test giving the p-value &lt; 0.01 on all input modifications. Finally, we stress that these results were obtained without any loss in the ROC AUC metric. We have tested the algorithm both on the original test-set. Both models achieved an ROC AUC of 0.66 (CI 0.63-0.69) on both test-sets. Limitations, reasons for caution Further work needs to be done to extend the set of possible augmentations of data. Wider implications of the findings Increased reliability of AI scoring algorithms for embryo selection. It is possible to obtain consistent results over a wide range of data modifications. Trial registration number not applicable

Abstract Study question Can iDAScore predict ongoing clinical pregnancy (CP) with a sensitivity and specificity equivalent to that associated with manual morphology assessment and grading? Summary answer The fully automated iDAScore was able to predict CP with equivalent performance to manual morphology assessment and grading in this retrospective cohort study. What is known already iDAScore is an artificial intelligence (AI) based algorithm developed by applying machine learning to morphokinetic time laps (TL) image data of embryos with a known treatment outcome. Embryos are automatically assessed on day 5 of culture and ranked according iDAScore, ranging from 1 to 9.9. Embryos may be prioritised for transfer based on highest sore. Along with other published AI algorithms, iDAScore has been proposed to optimise the chance of CP following ET by improving objectivity of embryo assessment compared with manual scoring systems. The fully autonomous assessment of embryos by AI algorithms has beneficial implications for laboratory workload. Study design, size, duration Retrospective audit of 787 fresh and 723 frozen single embryo transfer cycles which took place from April 2019 to September 2022. All recipient, surrogacy, warmed oocyte, embryo biopsy, cleavage stage and slow thaw frozen ET cycles were excluded. Participants/materials, setting, methods Selection for transfer was based on blastocyst morphology grade. iDAScores were obtained retrospectively. The area under the receiver operator characteristic (AUROC) curve and sensitivity and specificity for CP prediction of both iDAScore and blastocyst morphology grade was compared overall and in two stratified analyses. The first assessed fresh and frozen cycles separately, the second compared their performance in female age groups of ≤ 35 and &gt;35 years. CP was defined by ultrasound detection of foetal heartbeat. Main results and the role of chance The mean grading of blastocysts by iDA was 8.34 ± 1.4, with a strong correlation to classic morphological scoring (r = 0.69, P &lt; 0.001). The clinical pregnancy rate for fresh and frozen embryo transfer was 31.0% (95%CI 27.9-34.3) and 44.0% (95%CI 40.8-48.0) respectively. iDA score was positively associated with clinical pregnancy rates in both fresh (adjOR 1.69, 95%CI 1.44-1.99) and frozen embryo transfers (adjOR 1.45, 95%CI 1.26-1.67), independent of maternal age. There was no difference in the AUROC for iDA (AUC 0.64, 95%CI 0.62-0.67) versus conventional morphology (AUROC 0.63, 95CI 0.61-0.66) when all eSETs were considered, or when fresh eSETS (AUROC for fresh iDA 0.66 95%CI 0.62-0.70 vs morphology 0.65 95%CI 0.62-0.69) or frozen iDA 0.63 95%CI 0.59-0.67 vs morphology 0.61 95%CI 0.57-0.64) eSETS were considered separately. The iDA score exhibited slightly better performance in the age stratified analyses, with a higher AUROC in women &gt;35 years; iDA 0.68 95%CI 0.64-0.71 vs morphology 0.64 95%CI 0.60-0.67, p = 0.021) but no difference was observed in younger women. This age difference in iDA performance was primarily driven by frozen embryo transfers (p = 0.002). For women &gt;35years an iDA score of 8.75 was associated with an AUROC of 0.63 and 67% sensitivity and 60% specificity for prediction of clinical pregnancy. Limitations, reasons for caution This was a retrospective single centre study and the performance of iDA needs to be confirmed prospectively in a multi-centre trial. That selection of embryos for transfer was based on morphology, may have contributed to overestimation of model performance. Wider implications of the findings The application of AI based embryo ranking algorithms to embryo selection for transfer, has the potential to maintain clinical pregnancy rates while reducing the time burden associated with conventional morphology / morphokinetic assessments. Trial registration number Nil

Ranking Algorithm Research Articles

Related Topics

Articles published on Ranking Algorithm

Visualization communication mode and path optimization of data news in the context of big data

A method for time-varying analysis of YouTube search results and related videos: The case of the war in Ukraine

Dual Homogeneous Patches-Based Band Selection Methodology for Hyperspectral Classification

MFM-based alarm root-cause analysis and ranking for nuclear power plants

Avaliação de Políticas Sustentáveis

Analysis of lung cancer risk factors from medical records in Ethiopia using machine learning.

The node importance evaluation method based on graph convolution in multilayer heterogeneous networks

ADAPTATION OF LAMBDAMART MODEL TO SEMI-SUPERVISED LEARNING

Solving barrier ranking in clean energy adoption: An MCDM approach with q-rung orthopair fuzzy preferences

Emergency shelter materials under a complex non-linear diophantine fuzzy decision support system

Reliability of domain authority scores calculated by Moz, Semrush, and Ahrefs

Identification of Gene Markers Associated with COVID-19 Severity and Recovery in Different Immune Cell Subtypes.

An Optimization Ranking Approach Based on Weighted Citation Networks and P-Rank Algorithm

Marathi Text Summarization using Extractive Technique

Multi-level analysis of the gut–brain axis shows autism spectrum disorder-associated molecular and microbial profiles

P-225 Trustworthy AI algorithm for embryo ranking

P-262 Comparative ability of iDAScore and embryo morphology grade to predict clinical pregnancy: Retrospective cohort study of 1510 fresh and frozen elective single embryo transfers

User Real-Time Influence Ranking Algorithm of Social Networks Considering Interactivity and Topicality.

Who Will Purchase This Item Next? Reverse Next Period Recommendation in Grocery Shopping

Bibliometric‐enhanced legal information retrieval: Combining usage and citations as flavors of impact relevance

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Ranking Algorithm Research Articles

Related Topics

Articles published on Ranking Algorithm

Visualization communication mode and path optimization of data news in the context of big data

A method for time-varying analysis of YouTube search results and related videos: The case of the war in Ukraine

Dual Homogeneous Patches-Based Band Selection Methodology for Hyperspectral Classification

MFM-based alarm root-cause analysis and ranking for nuclear power plants

Avaliação de Políticas Sustentáveis

Analysis of lung cancer risk factors from medical records in Ethiopia using machine learning.

The node importance evaluation method based on graph convolution in multilayer heterogeneous networks

ADAPTATION OF LAMBDAMART MODEL TO SEMI-SUPERVISED LEARNING

Solving barrier ranking in clean energy adoption: An MCDM approach with q-rung orthopair fuzzy preferences

Emergency shelter materials under a complex non-linear diophantine fuzzy decision support system

Reliability of domain authority scores calculated by Moz, Semrush, and Ahrefs

Identification of Gene Markers Associated with COVID-19 Severity and Recovery in Different Immune Cell Subtypes.

An Optimization Ranking Approach Based on Weighted Citation Networks and P-Rank Algorithm

Marathi Text Summarization using Extractive Technique

Multi-level analysis of the gut–brain axis shows autism spectrum disorder-associated molecular and microbial profiles

P-225 Trustworthy AI algorithm for embryo ranking

P-262 Comparative ability of iDAScore and embryo morphology grade to predict clinical pregnancy: Retrospective cohort study of 1510 fresh and frozen elective single embryo transfers

User Real-Time Influence Ranking Algorithm of Social Networks Considering Interactivity and Topicality.

Who Will Purchase This Item Next? Reverse Next Period Recommendation in Grocery Shopping

Bibliometric‐enhanced legal information retrieval: Combining usage and citations as flavors of impact relevance