Abstract

In an online shop scenario, learning high-quality product embedding that captures various aspects of the product is important to improve the accuracy of user rating prediction. There is a lot of research about product embedding learning, for example, the side information which is the fusion of user feedback and the appearance of a product. However, because of the diversity of a product’s aspects, taking into account only its appearance as side information is not sufficient to accurately learn its embedding. In this paper, we present a matrix co-factorization method that employs information hidden in the so-called “also-viewed” or “also-bought” products, i.e., a list of products that have also been viewed or have also been bought by a user who has viewed a target product. To improve the accuracy of the rating prediction, our first step is to find out similar users. However, suppose the dataset is very large, e.g., if we have to deal with tens of millions of users’ data, the similarity calculation among users will be very time-consuming. For dealing with this problem, we use a compact binary sketch (i.e. ES, Even Sketch for user similarity estimation) to estimate user similarity. Our experiments demonstrate the superiority of our method in comparison with a state-of-the-art baseline in generating high-accuracy rating prediction.

Highlights

  • Triggered by the Netflix Prize [1] in 2009, which focused on predicting user ratings on movies based on past user feedback, many studies have been addressed on accurately predicting user ratings [2]

  • In this paper, aiming at learn high-quality product embedding, we talk about the influence of product aspects for users in different product domains

  • As a result, when we shop online, we focus the attention on different aspects of products in different product domains

Read more

Summary

INTRODUCTION

Triggered by the Netflix Prize [1] in 2009, which focused on predicting user ratings on movies based on past user feedback, many studies have been addressed on accurately predicting user ratings [2]. [10] presented a matrix co-factorization method named Visual Matrix Co-Factorization (VMCF) to improve the user rating prediction accuracy, where the ‘‘-viewed’’ products were considered but other attributes were overlooked, such as ‘‘-bought’’ and ‘‘bought-together’’. Suppose the dataset is very large, e.g., if we have to deal with tens of millions of users’ records, the calculation of similarities among users will be very time-consuming (1) To the best of our knowledge, few existing works considered information hidden in the so-called ‘‘-viewed’’ or ‘‘-bought’’ products in user rating prediction scenario.

RELATED WORKS
MODELING ALSO-VIEWED RELATIONSHIPS
MODELING ALSO-BOUGHT RELATIONSHIPS
OBJECTIVE FUNCTION
COMPLEXITY ANALYSIS
EXPERIMENTAL RESULTS
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call