Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology

Keneth Morgan Kwayu,Valerian Kwigizile,Kevin Lee,Jun-Seok Oh

doi:10.1016/j.aap.2020.105899

Abstract

The proliferation of digital textual archives in the transportation safety domain makes it imperative for the inventions of efficient ways of extracting information from the textual data sources. The present study aims at utilizing crash narratives complemented by crash metadata to discern the prevalence and co-occurrence of themes that contribute to crash incidents. Ten years (2009–2018) of Michigan traffic fatal crash narratives were used as a case study. The structural topic modeling (STM) and network topology analysis were used to generate and examine the prevalence and interaction of themes from the crash narratives that were mainly categorized into pre-crash events, crash locations and involved parties in the traffic crashes. The main advantage of the STM over the other topic modeling approaches is that it allows the researchers to discover themes from documents and estimate how the topic relates to the document metadata. Topics with the highest prevalence for the angle, head-on, rear-end, sideswipe and single motor vehicle crashes were crash at stop-sign, crossing the centerline, unable to stop, lane change maneuver and run-off-road crash, respectively. Eigenvector centrality measure in network topology showed that event-related topics were consistently central in articulating the crash occurrence. The centrality and association between topics varied across crash types. The efficacy of generated topics in classifying crashes by type was tested using a machine learning algorithm, Random Forest. The classification accuracy in the held-out sample ranged between 89.3 % for sideswipe crashes to 99.2 % for single motor vehicle crashes. High classification accuracy suggests that automation of crash typing and consistency checks can be accomplished effectively by using extracted latent themes from the crash narratives.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology

Abstract

Talk to us

Similar Papers

More From: Accident Analysis and Prevention

Lead the way for us

Journal: Accident Analysis and Prevention	Publication Date: Dec 4, 2020
Citations: 37

Similar Papers

Comparison of Severity of Motorcyclist Injury by Crash Types
William H Schneider ... Peter T Savolainen
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2265
William H Schneider, et. al.William H Schneider ... Peter T Savolainen
01 Jan 2010
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2265

Traffic Safety on Acadia National Park Roadways
Xiao Xiao ... Mengqing Wang
Journal of Park and Recreation Administration | VOL. 37
Xiao Xiao, et. al.Xiao Xiao ... Mengqing Wang
01 Jan 2019
Journal of Park and Recreation Administration | VOL. 37

Modeling severities of motorcycle crashes using random parameters
Ahmed Farid ... Khaled Ksaibati
Journal of Traffic and Transportation Engineering (English Edition) | VOL. 8
Ahmed Farid, et. al.Ahmed Farid ... Khaled Ksaibati
24 Jun 2020
Journal of Traffic and Transportation Engineering (English Edition) | VOL. 8

Using Bidirectional Encoder Representations from Transformers (BERT) to classify traffic crash severity types
Amir Hossein Oliaee ... M Ashifur Rahman
Natural Language Processing Journal | VOL. 3
Amir Hossein Oliaee, et. al.Amir Hossein Oliaee ... M Ashifur Rahman
23 Apr 2023
Natural Language Processing Journal | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology

Abstract

Talk to us

Similar Papers

More From: Accident Analysis and Prevention