Abstract

Modelling criminal trial verdict outcomes using social network measures is an emerging research area in quantitative criminology. Few studies have yet analyzed which of these measures are the most important for verdict modelling or which data classification techniques perform best for this application. To compare the performance of different techniques in classifying members of a criminal network, this article applies three different machine learning classifiers–Logistic Regression, Naïve Bayes and Random Forest–with a range of social network measures and the necessary databases to model the verdicts in two real–world cases: the U.S. Watergate Conspiracy of the 1970’s and the now–defunct Canada–based international drug trafficking ring known as the Caviar Network. In both cases it was found that the Random Forest classifier did better than either Logistic Regression or Naïve Bayes, and its superior performance was statistically significant. This being so, Random Forest was used not only for classification but also to assess the importance of the measures. For the Watergate case, the most important one proved to be betweenness centrality while for the Caviar Network, it was the effective size of the network. These results are significant because they show that an approach combining machine learning with social network analysis not only can generate accurate classification models but also helps quantify the importance social network variables in modelling verdict outcomes. We conclude our analysis with a discussion and some suggestions for future work in verdict modelling using social network measures.

Highlights

  • Modelling criminal trial verdict outcomes is a classic problem in predictive criminology [1], building verdict classification models for criminal networks is a relatively new area of research

  • Random Forest (RF) was the classifier with the highest average scores and the lowest standard deviations for the accuracy, precision, recall, Matthews Correlation Coefficient (MCC) measures, and ROC Area (AUC)

  • Apparent is that Logistic Regression (LR) outperformed Naïve Bayes (NB) on all of the measures except Recall, where NB did better

Read more

Summary

Introduction

Modelling criminal trial verdict outcomes is a classic problem in predictive criminology [1], building verdict classification models for criminal networks is a relatively new area of research. The scientific investigation of social networks in criminal organizations is a branch of quantitative criminology that generates knowledge regarding such networks through the analysis of links between network members [2] Such an analysis requires data that, unlike the information normally employed by criminologists, bears directly on these membership ties. By examining these data, the researcher can explore in detail the social behaviour of criminal groups and organizations [3,4,5,6,7,8,9,10,11,12] and terrorists operations [13,14,15]. This focus on the ties or links between group members is what accounts for the success of social network analysis in the study of criminal organizations [16,17,18,19,20,21,22,23,24,25]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call