Accurate diagnosis of acute tubulointerstitial nephritis (AIN) often requires a kidney biopsy. We previously developed a diagnostic statistical model for predicting biopsy-confirmed AIN by combining four laboratory tests after evaluating over 150 potential predictors from the electronic health record. Here, we validate this diagnostic model in two biopsy-based cohorts at Johns Hopkins Hospital (JHH) and Yale, which were geographically and temporally distinct from the development cohort, respectively. We analyzed patients who underwent kidney biopsy at JHH and Yale University (2019-2023). We assessed discrimination (AUC) and calibration using previously derived model coefficients and recalibrated the model using an intercept correction factor that accounted for differences in baseline prevalence of AIN between development and validation cohorts. We included 1982 participants: 1454 at JHH and 528 at Yale. JHH (5%) and Yale (17%) had lower proportions of biopsies with AIN than the development set (23%). The AUC was 0.73 (0.66-0.79) at JHH and 0.73 (0.67-0.78) at Yale, similar to the development set (0.73 (0.64-0.81)). Calibration was imperfect in validation cohorts, particularly at JHH, but improved with application of an intercept correction factor. The model increased AUC of clinicians' prebiopsy suspicion for AIN by 0.10 to 0.77 (0.71-0.82). An AIN diagnostic model retained discrimination in two validation cohorts but needed recalibration to account for local AIN prevalence. The model improved clinicians' ability to predict AIN.