ObjectiveTo develop a predictive model of incidence of traumatic spinal cord injury (TSCI). MethodsThe data for training the model included both the incidence data and the covariates. The incidence data were extracted from systematic reviews and the covariates were extracted from data available in the international road federation database. Then the feature processing measures were taken. First we defined a hyper-parameter, missing-value threshold, in order to eliminate features that exceed this threshold. To tackle the problem of overfitting of model we determined the Pearson correlation of features and excluded those with more than 0.7 correlation. After feature selection three different models including simple linear regression, support vector regression, and multi-layer perceptron were examined to fit the purposes of this study. Finally, we evaluated the model based on three standard metrics: Mean Absolute Error, Root Mean Square Error, and R2. ResultsOur machine-learning based model could predict the incidence rate of TSCI with the mean absolute error of 4.66. Our model found “Vehicles in use, Total vehicles/Km of roads”, “Injury accidents/100 Million Veh-Km”, “Vehicles in use, Vans, Pick-ups, Lorries, Road Tractors”, “Inland surface Passengers Transport (Mio Passenger-Km), Rail”, and "% paved” as top predictors of transport-related TSCI (TRTSCI). ConclusionsOur model is proved to have a high accuracy to predict the incidence rate of TSCI for countries, especially where the main etiology of TSCI is related to road traffic injuries. Using this model, we can help the policymakers for resource allocation and evaluation of preventive measures.
Read full abstract