Backgrounddiabetices foot ulcer (DFU) are serious complications. It is crucial to detect and diagnose DFU early in order to provide timely treatment, improve patient quality of life, and avoid the social and economic consequences. Machine learning techniques can help identify risk factors associated with DFU development. ObjectiveThe aim of this study was to establish correlations between clinical and biochemical risk factors of DFU through local foot examinations based on the construction of predictive models using automated machine learning techniques. MethodsThe input dataset consisted of 566 diabetes cases and 50 DFU risk factors, including 9 local foot examinations. 340 patients with Class 0 labeling (low-risk DFU), 226 patients with Class 1 labeling (high-risk DFU). To divide the training group (consisting of 453 cases) and the validation group (consisting of 113 cases), as well as preprocess the data and develop a prediction model, a Monte Carlo cross-validation approach was employed. Furthermore, potential high-risk factors were analyzed using various algorithms, including Bayesian BYS, Multi-Gaussian Weighted Classifier (MGWC), Support Vector Machine (SVM), and Random Forest Classifier (RF). A three-layer machine learning training was constructed, and model performance was estimated using a Confusion Matrix. The top 30 ranking feature variables were ultimately determined. To reinforce the robustness and generalizability of the predictive model, an independent dataset comprising 248 cases was employed for external validation. This validation process evaluated the model's applicability and reliability across diverse populations and clinical settings. Importantly, the external dataset required no additional tuning or adjustment of parameters, enabling an unbiased assessment of the model's generalizability and its capacity to predict the risk of DFU. ResultsThe ensemble learning method outperformed individual classifiers in various performance evaluation metrics. Based on the ROC analysis, the AUC of the AutoML model for assessing diabetic foot risk was 88.48 % (74.44–97.83 %). Other results were found to be as follows: 87.23 % (63.33 %–100.00 %) for sensitivity, 87.43 % (70.00 %–100.00 %) for specificity, 87.33 % (76.66 %–95.00 %) for accuracy, 87.69 % (75.00 %–100.00 %) for positive predictive value, and 87.70 % (71.79 %–100.00 %) for negative predictive value. In addition to traditional DFU risk factors such as cardiovascular disorders, peripheral artery disease, and neurological damage, we identified new risk factors such as lower limb varicose veins, history of cerebral infarction, blood urea nitrogen, GFR (Glomerular Filtration Rate), and type of diabetes that may be related to the development of DFU. In the external validation set of 158 samples, originating from an initial 248 with exclusions due to missing labels or features, the model still exhibited strong predictive accuracy. The AUC score of 0.762 indicated a strong discriminatory capability of the model. Furthermore, the Sensitivity and Specificity values provided insights into the model's ability to correctly identify both DFU cases and non-cases, respectively. ConclusionThe predictive model, developed through AutoML and grounded in local foot examinations, has proven to be a robust and practical instrument for the screening, prediction, and diagnosis of DFU risk. This model not only aids medical practitioners in the identification of potential DFU cases but also plays a pivotal role in mitigating the progression towards adverse outcomes. And the recent successful external validation of our DFU risk prediction model marks a crucial advancement, indicating its readiness for clinical application. This validation reinforces the model's efficacy as an accessible and reliable tool for early DFU risk assessment, thereby facilitating prompt intervention strategies and enhancing overall patient outcomes.