Abstract

Personal vehicles are invariably being preferred over public transport nowadays. Contact-less feature inspection and analysis based on personal preferences will be in high demand among customers in the post-pandemic world. A comprehensive online car recommendation system will be the customers’ spontaneous choice to understand and select the features of vehicles. However, the clustering of such categorical features is a challenging task as it is difficult to compare two textual attributes. In this paper, we have designed a cloud-based system that will automatically address this issue. Motivated by the cooperative game theory and fuzzy technique, and integrating the concept of Shapley theorem, a categorical data clustering algorithm has been developed. At the same time, to overcome the major limitation of having a high time complexity of the order O(n2) associated with the Shapley computation, the proposed algorithm has been distributed using Apache Spark’s Map Reduce architecture in Google Cloud Platform. The model has been thoroughly validated based on its performance on several synthetic as well as real data sets. Finally, a car recommendation system has been proposed and tested on three car sell data sets. The proposed approach outperforms the corresponding existing categorical clustering approaches in terms of various clustering validity indices. To the best of the authors’ knowledge, this is the first attempt to apply Map Reduce based Shapley computation over the categorical clustering, which can find its application beyond the proposed car recommendation system as well.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call