Abstract

Understanding criminal activities, their structure and dynamics are fundamental for designing tools for crime prediction that can also guide crime prevention. Here, we study crimes committed in city community areas based on police crime reports and demographic data for the City of Chicago collected over 16 consecutive years. Our goal is to understand how the network of city community areas shapes dynamics of criminal offenses and demographic characteristics of their inhabitants. Our results reveal the presence of criminal hot-spots and expose the dynamic nature of criminal activities. We identify the most influential features for forecasting the per capita crime rate in each community. Our results indicate that city community crime is driven by spatio-temporal dynamics since the number of crimes committed in the past among the spatial neighbors of each community area and in the community itself are the most important features in our predictive models. Moreover, certain urban characteristics appear to act as triggers for the spatial spreading of criminal activities. Using the k-Means clustering algorithm, we obtained three clearly separated clusters of community areas, each with different levels of crimes and unique demographic characteristics of the district’s inhabitants. Further, we demonstrate that crime predictive models incorporating both demographic characteristics of a community and its crime rate perform better than models relying only on one type of features. We develop predictive algorithms to forecast the number of future crimes in city community areas over the periods of one-month and one-year using varying sets of features. For one-month predictions using just the number of prior incidents as a feature, the critical length of historical data, τc, of 12 months arises. Using more than τc months ensures high accuracy of prediction, while using fewer months negatively impacts prediction quality. Using features based on demographic characteristics of the district’s inhabitants weakens this impact somewhat. We also forecast the number of crimes in each community area in the given year. Then, we study in which community area and over what period an increase in crime reduction funding in this area will yield the largest reduction of the crime in the entire city. Finally, we study and compare the performance of various supervised machine learning algorithms classifying reported crime incidents into the correct crime category. Using the temporal patterns of various crime categories improves the classification accuracy. The methodologies introduced here are general and can be applied to other cities for which data about criminal activities and demographics are available.

Highlights

  • Criminal activities have been extensively studied for decades by sociologists, criminologists and law enforcement agents in a continuous effort to reduce crime in cities (Bettencourt 2015; Griffiths and Chavez 2004; Guarino-Ghezzi and Trevino 2010)

  • We studied the following four supervised machine learning algorithms: k-nearest neighbors, decision tree (DT), Naive Bayes (NB) and Support Vector Machine (SVM) and analyzed their performance with respect to various feature selections and crime categories

  • By including information about the time of day and the day of week when the crime occurred, we increased the accuracy of correctly labeling crime incidents from 20% up to 50%

Read more

Summary

Introduction

Criminal activities have been extensively studied for decades by sociologists, criminologists and law enforcement agents in a continuous effort to reduce crime in cities (Bettencourt 2015; Griffiths and Chavez 2004; Guarino-Ghezzi and Trevino 2010). The advancement of computational tools and the increasing availability of real data allow researchers to improve the study, understanding and modeling of criminal activities. Several cities across the U.S have made their crime records publicly available. Such data release benefits scientists by providing them with the access to data that can lead to effective crime modeling and forecasting, to help law enforcement agencies to better understand criminal activities, and to optimally allocate additional funding as preventive measures aiming to reduce crime

Objectives
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call