A promising approach to optimizing recovery in youth football has been the use of machine learning (ML) models to predict recovery states and prevent mental fatigue. This research investigates the application of ML models in classifying male young football players aged under (U)15, U17, and U19 according to their recovery state. Weekly training load data were systematically monitored across three age groups throughout the initial month of the 2019-2020 competitive season, covering 18 training sessions and 120 observation instances. Outfield players were tracked using portable 18-Hz global positioning system (GPS) devices, while heart rate (HR) was measured using 1 Hz telemetry HR bands. The rating of perceived exertion (RPE 6-20) and total quality recovery (TQR 6-20) scores were employed to evaluate perceived exertion, internal training load, and recovery state, respectively. Data preprocessing involved handling missing values, normalization, and feature selection using correlation coefficients and a random forest (RF) classifier. Five ML algorithms [K-nearest neighbors (KNN), extreme gradient boosting (XGBoost), support vector machine (SVM), RF, and decision tree (DT)] were assessed for classification performance. The K-fold method was employed to cross-validate the ML outputs. A high accuracy for this ML classification model (73-100%) was verified. The feature selection highlighted critical variables, and we implemented the ML algorithms considering a panel of 9 variables (U15, U19, body mass, accelerations, decelerations, training weeks, sprint distance, and RPE). These features were included according to their percentage of importance (3-18%). The results were cross-validated with good accuracy across 5-fold (79%). The five ML models, in combination with weekly data, demonstrated the efficacy of wearable device-collected features as an efficient combination in predicting football players' recovery states.
Read full abstract