Average Silhouette Coefficient Research Articles

Nature-inspired algorithms are based on the concepts of self-organization and complex biological systems. They have been designed by researchers and scientists to solve complex problems in various environmental situations by observing how naturally occurring phenomena behave. The introduction of nature-inspired algorithms has led to new branches of study such as neural networks, swarm intelligence, evolutionary computation, and artificial immune systems. Particle swarm optimization (PSO), social spider optimization (SSO), and other nature-inspired algorithms have found some success in solving clustering problems but they may converge to local optima due to the lack of balance between exploration and exploitation. In this paper, we propose a novel implementation of SSO, namely social spider optimization for data clustering using single centroid representation and enhanced mating operation (SSODCSC) in order to improve the balance between exploration and exploitation. In SSODCSC, we implemented each spider as a collection of a centroid and the data instances close to it. We allowed non-dominant male spiders to mate with female spiders by converting them into dominant males. We found that SSODCSC produces better values for the sum of intra-cluster distances, the average CPU time per iteration (in seconds), accuracy, the F-measure, and the average silhouette coefficient as compared with the K-means and other nature-inspired techniques. When the proposed algorithm is compared with other nature-inspired algorithms with respect to Patent corpus datasets, the overall percentage increase in the accuracy is approximately 13%. When it is compared with other nature-inspired algorithms with respect to UCI datasets, the overall percentage increase in the F-measure value is approximately 10%. For completeness, the best K cluster centroids (the best K spiders) returned by SSODCSC were specified. To show the significance of the proposed algorithm, we conducted a one-way ANOVA test on the accuracy values and the F-measure values returned by the clustering algorithms.

Read full abstract

Sports-related injuries can have a significant impact on an athlete’s performance and career. While some injuries are inevitable, many can be prevented. Cluster analysis is a useful statistical technique that can assign individuals into groups (i.e., latent subgroups) based on common characteristics. PURPOSE: To utilize cluster analysis to 1) identify the latent subgroups based on athletes’ injury history; and 2) examine the characteristics of latent subgroups among athletes. METHODS: A total of 1,538 college athletes competing in the South Eastern Conference in NCAA division I were segmented by three criteria; 1) Injury parts indicate the body part sustaining the injury, 2) Injury types describe the detail of their injury status such as strain, contusion or tendonitis. 3) Injury duration refers to how long the athlete was unable to participate in training. K-means clustering analysis with the Euclidean similarity of injury log vectors was conducted to label players. The number of groups(k) was determined by applying the average silhouette method. The characteristics of clusters were analyzed descriptively, and the sports were allocated to each group followed by the athlete clusters. RESULTS: Five clusters were identified by the maximum average silhouette coefficient (0.153) among coefficients for randomly drawn k’s between 2 to 20. The first group, mainly baseball, men’s basketball, and men’s tennis, had injury to their ankle, arm, and hamstring for contusion and strain for a few weeks. The second group was mostly from football, with injury to their ankle, knee, and shoulder with the most extended injury durations. The third group, mostly football or track and field, were the athletes likely to have knee inflammation, and the duration was nearly half of a year. The injured body parts of the fourth group were back, finger, and hamstring, and the types of injuries were fracture and tendonitis. This cluster was mainly women’s basketball and track and field athletes. The members of the last group had head injury (e.g., concussion), and were soccer, softball or volleyball athletes. CONCLUSION: This study may help practitioners in recognizing the likelihood of an athletes’ injury according to their sport. Additionally, coaches could also consider this information in daily practices.

Read full abstract

Average Silhouette Coefficient Research Articles

Articles published on Average Silhouette Coefficient

A Novel Ensemble Framework Based on K-Means and Resampling for Imbalanced Data

A novel variant of social spider optimization using single centroid representation and enhanced mating for data clustering.

Classification of Collegiate Athletes Based on Their Injury History

Blind Estimation PN Sequence in Soft Spread Spectrum Signal of Improved K-means Algorithm

Application of k-means clustering to environmental risk zoning of the chemical industrial area

A FAST k-MEANS IMPLEMENTATION USING CORESETS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Average Silhouette Coefficient Research Articles

Articles published on Average Silhouette Coefficient

A Novel Ensemble Framework Based on K-Means and Resampling for Imbalanced Data

A novel variant of social spider optimization using single centroid representation and enhanced mating for data clustering.

Classification of Collegiate Athletes Based on Their Injury History

Blind Estimation PN Sequence in Soft Spread Spectrum Signal of Improved K-means Algorithm

Application of k-means clustering to environmental risk zoning of the chemical industrial area

A FAST k-MEANS IMPLEMENTATION USING CORESETS