To the Editor: We were interested to read the article by Benton et al. (1) published in the October 2012 issue of J Strength Cond Res. The authors aimed to evaluate the effect of training status on differences between trials of 1 repetition maximum (1RM) measurement in women. They reported that in untrained women (UTW), differences between chest press trials were nonsignificant, although a significant difference between leg press trials (1,2,4) was observed. Moreover, in trained women (TRW), differences between all 1RM trials were nonsignificant for both the chest press and leg press (1). These results do not reflect reliability and validity of 1RM and is a common mistake in reliability analysis (2–5). We found the article title of Benton et al. (1) incorrect. As the authors point out in their conclusion, for trained middle-aged women, a single trial of 1RM chest or leg press may be sufficient to obtain an accurate and reliable measure of maximal strength. Such a conclusion is misleading because of not only inappropriate use of statistical tests, but also misinterpretation of their results (2–5). Reliability and validity are 2 completely different methodological issues in researches. Reliability (repeatability or reproducibility) that indicates precision of a test was assessed by different statistical tests, such as Pearson r, least square, and paired t-test, in which all of them were among common mistakes in reliability analysis (3). Briefly, for quantitative variable intraclass correlation coefficient exact and for qualitative variables, weighted kappa should be used with caution because kappa has its own limitation too (2,4,5). Regarding validity (accuracy), sensitivity, specificity, positive predictive value, negative predictive value, likelihood ratio positive (LR+: true positive/false positive), and likelihood ratio negative (LR−: false negative/true negative) and odds ratio (true results/false results; preferably more than 50) are among the tests to evaluate the validity (accuracy) of a single test compared with a gold standard. As a take home message, for reliability and validity analysis, appropriate tests should be applied.