Abstract
Clustering refers to algorithms to uncover such clusters in unlabeled data. Data points belonging to the same cluster exhibit similar features, whereas data points from different clusters are dissimilar to each other. The identification of such clusters leads to segmentation of data points into a number of distinct groups. In this study it was aimed to classify the 492 Holstein Friesian dairy cattle with determining the optimum number of clusters using the genomic breeding values (GBVs) calculated with 13250 SNPs using GBLUP for milk yield (kg), milk fat (%), milk protein (%), milk lactose (%), and milk dry matter (%). Results showed that the optimum number cluster was determined as two for the genomic breeding values. Determining the most appropriate number of clusters, it provides great convenience in the selection of breeding animals after determining the animals that can provide optimum efficiency in the herd or the animals that need to be eliminated from the existing herd. As a result, it can be said that the k-means method can be used successfully in clustering animals for genomic breeding values, but for this, at first, the optimum number of clusters must be determined.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have