PurposeTo evaluate the diagnostic performance of an Artificial Intelligence (AI) algorithm, previously trained using both adult and pediatric patients, for the detection of acute appendicular fractures in the pediatric population on conventional X-ray radiography (CXR). Materials and methodsIn this retrospective study, anonymized extremities CXRs of pediatric patients (age <17 years), with or without fractures, were included. Six hundred CXRs (maintaining the positive-for-fracture and negative-for-fracture balance) were included, grouping them per body part (shoulder/clavicle, elbow/upper arm, hand/wrist, leg/knee, foot/ankle). Follow-up CXRs and/or second-level imaging were considered as reference standard. A deep learning algorithm interpreted CXRs for fracture detection on a per-patient, per-radiograph, and per-location level, and its diagnostic performance values were compared with the reference standard. AI diagnostic performance was computed by using cross-tables, and 95 % confidence intervals [CIs] were obtained by bootstrapping. ResultsThe final cohort included 312 male and 288 female with a mean age of 8.9±4.5 years. Three undred CXRs (50 %) were positive for fractures, according to the reference standard. For all fractures, the AI tool showed a per-patient 91.3% (95%CIs = 87.6–94.3) sensitivity, 76.7% (71.5-81.3) specificity, and 84% (82.1–86.0) accuracy. In the per-radiograph analysis the AI tool showed 85% (81.9–87.8) sensitivity, 88.5% (86.3–90.4) specificity, and 87.2% (85.7–89.6) accuracy. In the per-location analysis, the AI tool identified 606 bounding boxes: 472 (77.9 %) were correct, 110 (18.1 %) incorrect, and 24 (4.0 %) were not-overlapping. ConclusionThe AI algorithm provides good overall diagnostic performance for detecting appendicular fractures in pediatric patients.