Stingless bees are paramount in food chain as they are important pollinators of field crops. Recent studies revealed that these bees are seriously threatened by climate change and rapid urbanization across the world. It is thus important to study the relationship between the stingless bee’s diversity and the characteristics of the locations they inhibit. At the same time, clustering algorithms is a powerful machine learning approach in exploring unsupervised data. Consequently, this study aims to explore the stingless bee diversity in Malaysia through hierarchical, k-means and DBSCAN clustering. The dataset of this study consists of individual stingless bees collected from 12 locations. It comprises 14 environmental features, 3 physical characteristics, 35 species count, 12 genera counts and 3 diversity-and-abundance weights. A four-stage methodology is employed in the study. The results show that DBSCAN effectively groups data into clusters that are well-defined, but the results are less informative. In contrast, hierarchical and k-means clustering are found producing results that provide clearer insights, with hierarchical clustering delivering notably richer results.
Read full abstract