Abstract
BackgroundThe police attend numerous domestic violence events each year, recording details of these events as both structured (coded) data and unstructured free-text narratives. Abuse types (including physical, psychological, emotional, and financial) conducted by persons of interest (POIs) along with any injuries sustained by victims are typically recorded in long descriptive narratives.ObjectiveWe aimed to determine if an automated text mining method could identify abuse types and any injuries sustained by domestic violence victims in narratives contained in a large police dataset from the New South Wales Police Force.MethodsWe used a training set of 200 recorded domestic violence events to design a knowledge-driven approach based on syntactical patterns in the text and then applied this approach to a large set of police reports.ResultsTesting our approach on an evaluation set of 100 domestic violence events provided precision values of 90.2% and 85.0% for abuse type and victim injuries, respectively. In a set of 492,393 domestic violence reports, we found 71.32% (351,178) of events with mentions of the abuse type(s) and more than one-third (177,117 events; 35.97%) contained victim injuries. “Emotional/verbal abuse” (33.46%; 117,488) was the most common abuse type, followed by “punching” (86,322 events; 24.58%) and “property damage” (22.27%; 78,203 events). “Bruising” was the most common form of injury sustained (51,455 events; 29.03%), with “cut/abrasion” (28.93%; 51,284 events) and “red marks/signs” (23.71%; 42,038 events) ranking second and third, respectively.ConclusionsThe results suggest that text mining can automatically extract information from police-recorded domestic violence events that can support further public health research into domestic violence, such as examining the relationship of abuse types with victim injuries and of gender and abuse types with risk escalation for victims of domestic violence. Potential also exists for this extracted information to be linked to information on the mental health status.
Highlights
BackgroundDomestic violence is a global social and public health phenomenon with important health consequences that affect thousands of lives each year [1,2,3]
We investigate whether the application of a text mining method can automatically extract abuse types and sustained victim injuries from a large-scale corpus of 492,393 domestic violence events
The text mining system was evaluated on a set of 100 previously unseen, randomly chosen domestic violence event reports
Summary
Domestic violence is a global social and public health phenomenon with important health consequences that affect thousands of lives each year [1,2,3] It can be defined as “any incident of threatening behavior, violence (or psychological, physical, sexual, financial, emotional) abuse between adults who are or have been an intimate partner or family member, regardless of gender or sexuality” [4,5,6]. Abuse types (including physical, psychological, emotional, and financial) conducted by persons of interest (POIs) along with any injuries sustained by victims are typically recorded in long descriptive narratives. Conclusions: The results suggest that text mining can automatically extract information from police-recorded domestic violence events that can support further public health research into domestic violence, such as examining the relationship of abuse types with victim injuries and of gender and abuse types with risk escalation for victims of domestic violence. Potential exists for this extracted information to be linked to information on the mental health status
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.