IntroductionNowadays, a significant part of goods and passengers are transported on suburban highways with mainly high-speed vehicles. Hence, these highways are very prone to collisions with different injuries. For this reason, road collisions have become one of the largest international health issues in the world recently. Due to the high fatality or severe physical/mental injury rates caused by car collisions and the complex interactions between the factors affecting them, analyzing these collision-prone areas, identifying the factors affecting their occurrences, and discovering knowledge in the form of key rules is crucial, as the purposes for this study. MethodsThree supervised algorithms including Artificial Neural Network (ANN), Support Vector Machine (SVM), and Random Forests (RF) were used to build up classification models for the fatality severity of 2355 fatal collision data records during 2007–2009 occurred in the roadways of 8 states in the USA with different driver-related, environmental, and road factors. Predicted risk maps were generated for each classifier and the importance of contributing factors was evaluated based on the mean decrease in accuracy and the mean decrease in Gini Index. Finally, association rule mining was performed by the Apriori algorithm to extract collision rules. ResultsRF outperformed the other methods in terms of the highest overall accuracy and kappa rates, which were 94% and 92% respectively. The risk maps revealed that collisions with the most fatalities were mainly concentrated in the northern states. The number of travel lanes, speed limit, roadway profile, and light conditions were the most influencing factors in the collisions. 68 association rules were mined by the Apriori algorithm. Then, important rules were specified to extract hidden information from the collision data. ConclusionsThe RF model was more accurate in collision analysis, along with comprehensive collision factors. Great importance should be given to road factors in route design, especially in critical locations with severe fatality risk.
Read full abstract