Collaborative recommendation involves leveraging information from one entity to suggest attributes of other entities through pattern analysis. These models are invaluable for understanding data behaviour’s impact on related entities. Researchers propose various pattern recognition and correlation models, employing distance metrics like Jaccard, Cosine, etc., to estimate correlations between user queries and recommendation datasets. However, these models become less efficient as dataset size increases due to exponential correlation estimation delays. To enhance scalability while retaining recommendation quality, a new approach is introduced: ADCMDES. This novel augmented cross-domain collaborative model employs a hybrid distance metric and ensemble stratification for dataset pruning. It operates semi-supervisedly, requiring information about the entities being collaborated upon. This data is used to cluster similar collaborative entities, producing a condensed cluster with greater relevancy to user queries. Utilizing word2vec, records are transformed into features for an ensemble classification engine. The resultant model categorizes user input, directing it to the most relevant cluster. Entries within this cluster are ranked using a hybrid metric amalgamating 18 distance measures, enhancing correlation between input queries and recommendations. In testing, ADCMDES exhibited 15% better accuracy, 8% better precision, 9% better recall, and 3% lower RMSE compared to standard models across datasets. While some delay is introduced due to ensemble classification and augmented feature pooling, this doesn't considerably affect long-term recommendation performance and can be mitigated through parallel processing and redundancy reduction techniques.
Read full abstract