Abstract
The recent advancements in the field of data mining have made vast progress in extracting new information and hidden patterns from large datasets which are often overlooked by the traditional statistical approaches. These methods focus on searching for new and interesting hypothesis which were previously unobserved. Road safety researchers working with the crash data from developed world have seen encouraging success in obtaining new insight into crash mechanism through data mining. An attempt was made in this study to apply these advance methods and evaluate their performance in manifesting crash causes for Bangladesh. The study applies hierarchical clustering to identify hazardous clusters, random forest to find important variables explaining each of these clusters, and classification and regression trees to unveil their respective crash mechanisms for the road crash data of Bangladesh. The results identified several new interesting relationships and acknowledged issues related to quality of data.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have