Abstract. The total aviation effective radiative forcing is dominated by non-CO2 effects. The largest contributors to the non-CO2 effects are contrails and contrail cirrus. There is the possibility of reducing the climate effect of aviation by avoiding flying through ice-supersaturated regions (ISSRs), where contrails can last for hours (so-called persistent contrails). Therefore, a precise prediction of the specific location and time of these regions is needed. But a prediction of the frequency and degree of ice supersaturation (ISS) on cruise altitudes is currently very challenging and associated with great uncertainties because of the strong variability in the water vapour field, the low number of humidity measurements at the air traffic altitude, and the oversimplified parameterisations of cloud physics in weather models. Since ISS is more common in some dynamical regimes than in others, the aim of this study is to find variables/proxies that are related to the formation of ISSRs and to use these in a regression method to predict persistent contrails. To find the best-suited proxies for regressions, we use various methods of information theory. These include the log-likelihood ratios, known from Bayes' theorem, a modified form of the Kullback–Leibler divergence, and mutual information. The variables (the relative humidity with respect to ice, RHiERA5; the temperature, T; the vertical velocity, ω; the divergence, DIV; the relative vorticity, ζ; the potential vorticity, PV; the normalised geopotential height, Z; and the local lapse rate, γ) come from ERA5, and RHiM/I, which we assume as the truth, comes from MOZAIC/IAGOS (Measurement of Ozone and Water Vapour on Airbus In-service Aircraft/In-service Aircraft for a Global Observing System; commercial aircraft measurements). It turns out that RHiERA5 is the most important predictor of ice supersaturation, in spite of its weaknesses, and all other variables do not help much to achieve better results. Without RHiERA5, a regression to predict ISSRs is not successful. Certain modifications of RHiERA5 before the regression (as suggested in recent papers) do not lead to improvements of ISSR prediction. Applying a sensitivity study with artificially modified RHiERA5 distributions points to the origin of the problems with the regression: the conditional distributions of RHiERA5 (conditioned on ISS and non-ISS, from RHiM/I) overlap too heavily in the range of 70 %–100 %, so for any case in that range, it is not clear whether it belongs to an ISSR or not. Evidently, this renders the prediction of contrail persistence very difficult.