ObjectivesTo construct and externally validate machine learning-based nomograms for predicting progression stages of initial prostate cancer (PCa) using biomarkers and clinicopathologic features. Methods362 inpatients diagnosed with PCa at the First Affiliated Hospital were randomly assigned to training and testing sets in a 3:7 ratio, while 136 PCa patients from People's Hospital formed the external validation set. Imaging and clinicopathologic information were collected. Optimal features distinguishing advanced prostate cancer (APC) and metastatic PCa (mPCa) were identified through logistic regression (LR). ML algorithms were employed to build and compare ML models. The best-performing algorithm established models for PCa progression stage. Models performance was evaluated using metrics, ROC curves, calibration, and decision curve analysis (DCA) in training, testing, and external validation sets. ResultsFollowing LR analyses, PSA (P=0.001), maximum tumor diameter (P=0.026), Gleason score (P<0.001), and RNF41 (P<0.001) were optimal features for predicting APC, while ALP (P<0.001), PSA (P<0.001), and GS score (P=0.024) were for mPCa. Among ML models, the LR models exhibited superior performance. Consequently, the LR algorithm was used for the APC-risk-nomogram and mPCa-risk-nomogram construction, with AUC values of 0.848, 0.814, 0.810, and 0.940, 0.913, 0.910, in the training, testing, and external validation sets, respectively. Calibration and DCA curves affirmed nomograms' consistency and net benefits for clinical decision-making. ConclusionsIn summary, ML-based APC-risk-nomogram and mPCa-risk-nomogram exhibit outstanding predictive performance for PCa progression stages. These nomograms can assist clinicians in finely categorizing newly diagnosed PCa patients, facilitating personalized treatment plans and prognosis assessment.