STUDY QUESTIONCan the accuracy of timing of luteal phase endometrial biopsies based on urinary ovulation testing be improved by measuring the expression of a small number of genes and a continuous, non-categorical modelling approach?SUMMARY ANSWERMeasuring the expression levels of six genes (IL2RB, IGFBP1, CXCL14, DPP4, GPX3 and SLC15A2) is sufficient to obtain substantially more accurate timing estimates and to assess the reliability of timing estimates for each sample.WHAT IS KNOWN ALREADYCommercially available endometrial timing approaches based on gene expression require large gene sets and use a categorical approach that classifies samples as pre-receptive, receptive or post-receptive.STUDY DESIGN, SIZE, DURATIONGene expression was measured by RTq-PCR in different sample sets, comprising a total of 664 endometrial biopsies obtained 4–12 days after a self-reported positive home ovulation test. A further 36 endometrial samples were profiled by RTq-PCR as well as RNA-sequencing.PARTICIPANTS/MATERIALS, SETTING, METHODSA computational procedure, named ‘EndoTime’, was established that models the temporal profile of each gene and estimates the timing of each sample. Iterating these steps, temporal profiles are gradually refined as sample timings are being updated, and confidence in timing estimates is increased. After convergence, the method reports updated timing estimates for each sample while preserving the overall distribution of time points.MAIN RESULTS AND THE ROLE OF CHANCEThe Wilcoxon rank-sum test was used to confirm that ordering samples by EndoTime estimates yields sharper temporal expression profiles for held-out genes (not used when determining sample timings) than ordering the same expression values by patient-reported times (GPX3: P < 0.005; CXCL14: P < 2.7e−6; DPP4: P < 3.7e−13). Pearson correlation between EndoTime estimates for the same sample set but based on RTq-PCR or RNA-sequencing data showed a high degree of congruency between the two (P = 8.6e−10, R2 = 0.687). Estimated timings did not differ significantly between control subjects and patients with recurrent pregnancy loss or recurrent implantation failure (P > 0.05).LARGE SCALE DATAThe RTq-PCR data files are available via the GitHub repository for the EndoTime software at https://github.com/AE-Mitchell/EndoTime, as is the code used for pre-processing of RTq-PCR data. The RNA-sequencing data are available on GEO (accession GSE180485).LIMITATIONS, REASONS FOR CAUTIONTiming estimates are informed by glandular gene expression and will only represent the temporal state of other endometrial cell types if in synchrony with the epithelium. Methods that estimate the day of ovulation are still required as these data are essential inputs in our method. Our approach, in its current iteration, performs batch correction such that larger sample batches impart greater accuracy to timing estimations. In theory, our method requires endometrial samples obtained at different days in the luteal phase. In practice, however, this is not a concern as timings based on urinary ovulation testing are associated with a sufficient level of noise to ensure that a variety of time points will be sampled.WIDER IMPLICATIONS OF THE FINDINGSOur method is the first to assay the temporal state of luteal-phase endometrial samples on a continuous domain. It is freely available with fully shared data and open-source software. EndoTime enables accurate temporal profiling of any gene in luteal endometrial samples for a wide range of research applications and, potentially, clinical use.STUDY FUNDING/COMPETING INTEREST(S)This study was supported by a Wellcome Trust Investigator Award (Grant/Award Number: 212233/Z/18/Z) and the Tommy's National Miscarriage Research Centre. None of the authors have any competing interests. J.L. was funded by the Biotechnology and Biological Sciences Research Council (UK) through the Midlands Integrative Biology Training Partnership (MIBTP, BB/M01116X/1).
Read full abstract