Abstract
Crash prediction models are used extensively in highway safety analysis. This paper discusses a recently developed data-mining technique to predict motor vehicle crashes: the multivariate adaptive regression splines (MARS) technique. MARS shows promising predictive power and does not suffer from a black-box limitation. Negative binomial (NB) and MARS models were fitted and compared with the use of extensive data collected on unsignalized intersections in Florida. Two models were estimated for rear-end crash frequency at three-and four-legged unsignalized intersections. Treatment of crash frequency as a continuous response variable to fit a MARS model was also examined by normalizing crash frequency with the natural logarithm of the annual average daily traffic. The combination of MARS with a machine learning technique (random forest) was explored and discussed. The significant factors that affected rear-end crashes were traffic volume on the major road, upstream and downstream distances to the nearest signalized intersection, median type on the major approach, land use at the intersection's influence area, and geographic location within the state. The study showed that MARS could predict crashes almost like the traditional NB models, and its goodness-of-fit performance was encouraging. The use of MARS to predict continuous response variables yielded more favorable results than its use to predict discrete response variables. The generated MARS models showed the most promising results after the covariates were screened by using random forest.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Transportation Research Record: Journal of the Transportation Research Board
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.