BackgroundMachine learning (ML) has arrived in medicine to deliver individually adapted medical care. This study sought to use ML to discriminate stent restenosis (SR) compared with existing predictive scores of SR. To develop an easily applicable model, we performed our predictions without any additional variables other than those obtained in daily practice. MethodsThe dataset, obtained from the Grupo de Análisis de la Cardiopatía Isquémica Aguda (GRACIA)-3 trial, consisted of 263 patients with demographic, clinical, and angiographic characteristics; 23 (9%) of them presented with SR at 12 months after stent implantation. A methodology to work with small imbalanced datasets, based in cross-validation and the precision/recall (PR) plots, was used, and state-of-the-art ML classifiers were trained. ResultsOur best performing model (0.46, area under the PR curve [AUC-PR]) was developed with an extremely randomized trees classifier, which showed better performance than chance alone (0.09 AUC-PR, corresponding to the 9% of patients presenting SR in our dataset) and 3 existing scores; Prevention of Restenosis With Tranilast and its Outcomes (PRESTO)-1 (0.31 AUC-PR), PRESTO-2 (0.27 AUC-PR), and Evaluation of Drug-Eluting Stents and Ischemic Events (EVENT) (0.18 AUC-PR). The most important variables ranked according to their contribution to the predictions were diabetes, ≥2 vessel-coronary disease, post-percutaneous coronary intervention thrombolysis in myocardial infarction (PCI TIMI)-flow, abnormal platelets, post-PCI thrombus, and abnormal cholesterol. To counteract the lack of external validation for our study, we deployed our ML algorithm in an open source calculator, in which the model would stratify patients of high and low risk as an example tool to determine generalizability of prediction models from small imbalanced sample size. ConclusionsApplied immediately after stent implantation, a ML model better differentiates those patients who will present with SR over current discriminators.