BackgroundVariations in defining poor response to total knee arthroplasty (TKA) impede comparisons of response after TKA over time and across hospitals. This study aimed to compare the prevalence, overlap, and discriminative accuracy of 15 definitions of poor response after TKA using 2 databases. MethodsData of patients one year after primary TKA from the Dutch Arthroplasty Register (n = 12,275) and the Osteoarthritis Initiative database (n = 204) were used to examine the prevalence, overlap (estimated by Cohen’s kappa), and discriminative accuracy (sensitivity, specificity, positive predictive value, negative predictive value, and Youden index) of 15 different definitions of poor response after TKA. In the absence of a gold standard for measuring poor response to TKA, the numeric rating scale satisfaction (≤ 6 ‘poor responder’) and the global assessment of knee impact (dichotomized: ≥ 4 ‘poor responder’) were used as anchors for assessing discriminative accuracy for the Dutch Arthroplasty Register and Osteoarthritis Initiative dataset, respectively. These anchors were chosen based on a prior qualitative study that identified (dis)satisfaction as a central theme of poor responses to TKA by patients and knee specialists. ResultsThe median (25th to 75th percentile) prevalence of poor responders in the examined definitions was 18.5% (14.0 to 25.5%), and the median Cohen’s kappa for the overlap between pairs of definitions was 0.41 (0.32 to 0.59). Median (25th to 75th percentile) sensitivity was 0.45 (0.39 to 0.54), specificity was 0.86 (0.82 to 0.94), positive predictive value was 0.45 (0.34 to 0.62), negative predictive value was 0.89 (0.87 to 0.89), and the Youden index was 0.36 (0.20 to 0.43). ConclusionsThis study found a lack of overlap between different definitions of poor response to TKA. None of the examined definitions adequately classified poor responders to TKA. In contrast, the absence of a poor response could be classified with confidence.