Abstract

Crash severity is one of the most widely studied topics in traffic safety area. Scholars have studied crash severity through various types of models. Using the publicly available 2017 Maryland crash data from the Department of Maryland State Police, the authors develop a multinomial logit (MNL) model and a random forest (RF) model, which belong to discrete choice and tree-based models, respectively, to (1) identify factors contributing to crash severity and (2) compare prediction performances and interpretation abilities between the two models. Based on the model results, major contributing factors of crash severity are identified, including collision type, occupant age, and speed limit. For the given dataset, RF has a higher prediction accuracy than MNL based on multiple measures (precision, recall, and F1 score), even though the differences are not dramatic. Sensitivity analysis results show that RF is less sensitive than MNL. RF can automatically capture the non-linear effects of continuous variables and reduce the influence of collinearity relationships existing among explanatory variables. This study shows the possibility of conducting sensitivity analysis to enhance understanding of MNL and RF results, and uncovers unique characteristics of the discrete choice and tree-based models.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.