Abstract

The goal of this research paper is to propose an approach to uncovering important and previously unknown indicators that could prevent suicide by analyzing data released by the Centers for Disease Control and Prevention (CDC). The CDC data includes 75 variables that identify characteristics and demographics of the deceased. This paper describes the process of exploring data science methods to build a predictive model that could lead to prevention of suicide rate in USA. The data was explored with the Python programming language, R scripting language, RStudio and in the Tableau environment. Preliminary analysis of the CDC 2013 deaths by suicide data indicate that education was one of the important factors that lead to suicide; further analysis of this and other data may provide a new set of objective risk factors from one or more of the variables in this data set.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.