In the current digital era, e-commerce has become one of the main pillars of global trade. With the ever-increasing amount of transaction and user activity data, e-commerce companies are faced with the challenge of understanding and managing diverse customer segments more effectively. This paper discusses the application of clustering algorithms for e-commerce customer segmentation based on purchasing data and user activity. The aim of this research is to identify homogeneous customer groups to support more targeted marketing strategies and increase customer retention. The problem faced is how to process big data originating from user transactions and activities on e-commerce platforms, as well as how to identify patterns that are useful for customer segmentation. The data used in this research includes purchase history, frequency of visits, length of time spent on the site, and interactions with certain products. The solution method applied in this research is the clustering algorithm, especially K-Means and DBSCAN. K-Means is used to group data into a predetermined number of clusters based on the Euclidean distance between data points. Meanwhile, DBSCAN is used to identify clusters with high density and separate data that is considered noise or outliers. Data preprocessing is carried out to clean and normalize the data before being applied to the clustering algorithm. Validation of clustering results is carried out using metrics such as Silhouette Score and Davies-Bouldin Index. The research results show that by applying the clustering algorithm, customers can be grouped into several segments that have similar characteristics. For example, we found groups of customers with high purchase frequency but low transaction value, as well as other groups with high transaction value but low purchase frequency. This information is very useful for companies to design more effective marketing strategies, such as special offers for customers with high transaction values or loyalty programs for customers with high purchasing frequency. The conclusion of this research is that clustering algorithms can be a very effective tool in e-commerce customer segmentation, allowing companies to understand customer behavior patterns and develop more targeted and effective marketing strategies. Thus, implementing this method is expected to improve business performance and overall customer satisfaction.
Read full abstract