AbstractPredicting short-term traffic crashes is challenging due to an imbalanced data set characterized by excessive zeros in noncrash counts, random crash occurrences, spatiotemporal correlation in crash counts, and inherent heterogeneity. Existing models struggle to effectively address these distinct characteristics in crash data. This paper proposes a new joint model by combining the time-series generalized regression neural network (TGRNN) model and the binomially weighted convolutional neural network (BWCNN) model. The joint model aims to capture all these characteristics in short-term crash prediction. The model was trained and tested using real-world, highly disaggregated traffic data collected with inductive loop detectors on the M1 motorway in the UK in 2019, along with crash data extracted from the UK National Accident Database for the same year. The short-term is defined as a 30-min interval, providing sufficient time for a traffic control center to implement interventions and mitigate potential hazards. The year was segmented into 30-min intervals, resulting in a highly imbalanced data set with over 99.99% noncrash samples. The joint model was applied to predict the probability of a crash occurrence by updating both the crash and traffic data every 30 min. The findings revealed that 75.3% of crashes and 81.6% of noncrash events were correctly predicted in the southbound direction. In the northbound direction, 78.1% of crashes and 80.2% of noncrash events were accurately captured. Causal analysis and model-based interpretation were used to analyze the relative importance of explanatory variables regarding their contribution to crashes. The results reveal that speed variance and speed are the most influential factors contributing to crash occurrence.