Numerous studies have determined a relationship between traffic violations and crash risk. However, not only have the results been inconsistent, but the effects of minor traffic violations have not been studied extensively. In addition, the characteristics of violation data (spatial autocorrelation) are widely ignored. Hence, various types of traffic violations and spatial variable selection methods (stepwise regression, adaptive LASSO, random forest, and Boruta) were employed to select key violation variables that could be used to predict crash frequency. Then, the study used Geographically Weighted Negative Binomial Regression (GWNBR) model to determine their varying associations with crash frequency. The results indicated that Boruta had the lowest error in variable selection, and GWNBR was the best-fitting model with a heterogeneous association between crash frequency and traffic violations. The study found that areas with a higher percentage of specific behaviour-related violations and other minor violations (illegal parking) were related to higher crash frequency.
Read full abstract