Abstract

Real-time crash risk prediction is a hot topic of emerging technology. Due to the lack of basic risk formation theory, previous studies focussed on the application of complex models to improve the accuracy of prediction, ignoring the interpretation of variables, while the traditional statistical analysis method can interpret variables, but the prediction accuracy is poor, which falls into a dilemma of trade-off. In this study, based on the traffic flow information of elevated expressway, an improved genetic programming (GP) approach with elite gene bank is applied to obtain an explicit traffic flow crash risk function to solve the above trade-off problem. Logistic regression and backward-propagation neural network combined with partial dependency plot were used as baseline methods to examine the interpretability and accuracy of GP. It is found that GP prediction model has been proved to be able to select important variables and solve the trade-off dilemma, which has good interpretability and accuracy. The results show that crash risk in the traffic flow mainly comes from the traffic volume, speed of the upstream section, and the speed of the current section. Furthermore, the error of GP comes from the unobserved heterogeneity and crash mechanism theory is proposed.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call