This research aims to assist MSMEs in overcoming problems in online sales. Currently, sellers only prepare stock without knowing how well the products are sold in their market segment. In the city of Tangerang alone, there are 222,602 MSMEs with various product categories. Therefore, besides utilizing offline sales, business actors should also engage in online sales. This research conducts feature selection using the Chi-Square method and Ensemble Tree Classifier to select the top 6 and 10 features. The SVM, KNN, and C4.5 algorithms are used to build prediction models based on the selected features. Using feature selection, it was found that the influential features are Estimated Shipping Cost, Shipping Cost Paid by Buyer, Total Product Price, and Estimated Shipping Cost Discount. The evaluation results using the three algorithms, SVM, KNN, and C4.5, indicate that the highest accuracy value is obtained when using the C4.5 model with data from the ensemble tree classifier with 6 features at 0.86%, followed by the C4.5 model with 10 features, KNN with 6 features, and KNN with 10 features, all of which source data from the ensemble tree classifier with an accuracy value of 0.85%.
Read full abstract