Abstract

In recent years, increased attention has been given to understanding the spatial pattern of crashes in urban areas. Accurately capturing the spatial relationship between crash counts and variables requires extracting hidden information from multiple data sources. In this study, we propose a machine learning model to explore the spatial impact of activity patterns on spatially aggregated crash counts. Our paper introduces a two-step framework: (a) the Latent Dirichlet Allocation (LDA) model, an unsupervised method for mining hidden activity patterns from floating vehicle trajectory data, and (b) the Graph Convolutional Network (GCN) model, which builds the spatial relationship between multi-source data. The data and hidden activity patterns were aggregated into 175 Traffic Analysis Zones (TAZs) in San Francisco using spatial partitioning. The GCN model provided higher prediction accuracy than commonly used machine learning algorithms that did not consider combined spatial relationships and those that only considered traditional vehicle counts data. Furthermore, we used attribution algorithms to obtain the respective weight scores of each factor. Our results reveal that daily vehicle kilometers traveled, road density, population density, commercial activity during weekends, and residential activity during morning peak hours on weekdays are factors associated with crashes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call