Abstract
Rationale: Given datasets with a large or diverse set of predictors of aggression, machine learning (ML) provides efficient tools for identifying the most salient variables and building a parsimonious statistical model. ML techniques permit efficient exploration of data, have not been widely used in aggression research, and may have utility for those seeking prediction of aggressive behavior.Objectives: The present study examined predictors of aggression and constructed an optimized model using ML techniques. Predictors were derived from a dataset that included demographic, psychometric and genetic predictors, specifically FK506 binding protein 5 (FKBP5) polymorphisms, which have been shown to alter response to threatening stimuli, but have not been tested as predictors of aggressive behavior in adults.Methods: The data analysis approach utilized component-wise gradient boosting and model reduction via backward elimination to: (a) select variables from an initial set of 20 to build a model of trait aggression; and then (b) reduce that model to maximize parsimony and generalizability.Results: From a dataset of N = 47 participants, component-wise gradient boosting selected 8 of 20 possible predictors to model Buss-Perry Aggression Questionnaire (BPAQ) total score, with R2 = 0.66. This model was simplified using backward elimination, retaining six predictors: smoking status, psychopathy (interpersonal manipulation and callous affect), childhood trauma (physical abuse and neglect), and the FKBP5_13 gene (rs1360780). The six-factor model approximated the initial eight-factor model at 99.4% of R2.Conclusions: Using an inductive data science approach, the gradient boosting model identified predictors consistent with previous experimental work in aggression; specifically psychopathy and trauma exposure. Additionally, allelic variants in FKBP5 were identified for the first time, but the relatively small sample size limits generality of results and calls for replication. This approach provides utility for the prediction of aggression behavior, particularly in the context of large multivariate datasets.
Highlights
Aggression is a complex multifaceted phenomenon (Anderson and Bushman, 2002; Raine, 2002; Mendes et al, 2009) that is influenced by many factors
FK506 binding protein 5 (FKBP5) allele distributions did not deviate from Hardy-Weinberg equilibrium
The present report used the mboost technique with subsequent backward elimination to determine a parsimonious set of predictors of trait aggression, highlighted by associations with callous affect, childhood trauma and FKBP5_13 alleles
Summary
Aggression is a complex multifaceted phenomenon (Anderson and Bushman, 2002; Raine, 2002; Mendes et al, 2009) that is influenced by many factors. As data science has become more established and widely utilized in scientific discovery and prediction (Hastie et al, 2009; Hofman et al, 2017; Wiens and Shenoy, 2018), novel inductive analytic techniques have enabled and advanced the analysis of complex, multivariate data. These approaches include mining of very large datasets, as well as application to smaller datasets where large amounts of information are obtained from each individual, but the dataset contains a relatively small number of subjects. We utilized a data science approach to examine predictors of trait aggression, including interpersonal and demographic variables, history of trauma, psychopathology and genetic variations in the FK506 binding protein 5 (FKBP5) protein
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.