Abstract
Effective Clustering of mental health data can provide significant insights into patterns and relationships that are critical for understanding mental health conditions. This study investigated various clustering techniques applied to balanced mental health data to avoid biases associated with an imbalanced data. Clustering of the balanced mental health data was done with respect to the area of Residence feature. Firstly, Random undersampling and SMOTE techniques were incorporated to the imbalanced data set as balancing techniques so as to improve model performance. Random Undersampling Technique turned out to be the most ideal balancing technique with its accuracy, recall, precision and F-score values as 1. After balancing the data, two clustering techniques were applied to the Random Undersampled balanced data. The two techniques were namely: K-means and Divisive techniques. In order to select which of the two clustering techniques is ideal, two test statistics namely Internal Validation and Stability Validation were applied. Results showed that K-means clustering technique indicated slightly lower Average Propotion of None-overlap, Average Distance between Means and Figure Of Merit values given as 0.12, 0.41 and 0.9972 as compared to Divisive clustering technique which were 0.14, 0.42 and 0.9999. The conclusion was that K-means clustering has a better performance. This study's findings will help guide future researchers dealing with mental health data analysis on ways to improve model performance for better and more reliable predictions.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.