In the present issue, Mazzolari et al. (2022) present a persuasive case for the wider use of frequentist interval testing (for equivalence and non-inferiority) for interventional studies in exercise physiology and sport science. A persistent issue in the field is small, noisy, underpowered studies. With conventional null hypothesis significance tests for superiority, claims of no (zero) effect abound when P > 0.05. However, the confidence interval for the estimated effect would often show that biologically or clinically meaningful effects (both ‘positive’ and ‘negative’, benefit and harm) were compatible with the data and model and may not be ruled out statistically. If we fully embrace and discuss uncertainty in the estimation of intervention effects, then a more robust conclusion in such situations would be ‘get more data – more evidence needed’. The famous phrase ‘absence of evidence is not evidence of absence’ (Altman & Bland, 1995) articulates this problem elegantly and succinctly. Here, ‘absence of evidence’ means, conventionally, P > 0.05 for the mean intervention effect (intervention minus comparator). Since Altman and Bland's note, this issue has been revisited in many journal editorials in an attempt to improve the reporting of trial results (e.g., Alderson, 2004). Mazzolari et al. (2022) note that frequentist interval testing is applicable if the research question focuses on whether two interventions have similar efficacy (equivalence) or whether one intervention is not substantially worse than another (non-inferiority). These approaches are used frequently in biomedical research. For example, a new treatment might be developed that might cost less, be less burdensome encouraging greater compliance, or be associated with fewer adverse events than the accepted standard treatment. In such cases we might want to know whether the proposed new treatment worked as well as, or at least not substantially worse than, the standard treatment. The authors identify a range of potential applications of these approaches in the physiology and sport science research field. Frequentist interval testing provides the desired error control and may often be a better match to the research questions that researchers are really interested in answering – questions that conventional null hypothesis significance testing for superiority cannot address. So, how do physiology and sport science researchers apply these methods in their work? The article by Mazzolari et al. (2022) is a tour de force that covers all of the key issues, such as how to establish the equivalence or non-inferiority margins, and sample size planning for these designs. The article references provide an excellent reading list for those wishing to delve deeper and increase their understanding. At the very least, this contribution will increase awareness and facilitate improved critical reading of published research. Those wishing to apply the methods in the analysis of their own data will benefit from the worked examples, including the accompanying workbook with code for two popular statistics software packages. Anyone considering an equivalence or non-inferiority study should seek expert statistics input from the outset. This is sage advice for any study, but perhaps especially for these designs wherein there are many pitfalls for the unwary. Many, if not all, readers will be familiar with the Consolidated Standards of Reporting Trials (CONSORT) statement, providing a checklist for the complete and transparent reporting of parallel-arm (independent groups) randomized trials. An extension to this statement is available for equivalence and non-inferiority studies (Piaggio et al., 2012). In summary, this article is essential reading for consumers and doers of research in physiology and sport science, and I commend it to the journal readership. None.