Abstract

Recommender systems provide an important tool for users to find interested items from the massive amount of user-generated contents. As user interests often change over time and contents become available in a streaming fashion, it is highly desirable to support real-time recommendation that can adapt to changes in user interests and contents. If we represent both user interests and items by high-dimensional points in the same vector space, we can recommend to the user the $k$ items that are the nearest neighbors (kNN) of the user. The problem of real-time recommendation, thus, translates to computing the kNNs based on the most recent items when the user interests change. As such, the main issue we tackle in this paper is to efficiently process high-dimensional kNN queries over a sliding window on data streams. In particular, we are interested in developing a scalable distributed solution to be able to handle the ever-increasing number of users and volume of data. We propose a new index structure called the dynamic bounded rings index (DBRI) to index the data points in data streams. The basic idea is to first find a set of pivots and assign all points to their nearest pivot to form subsets and then partition each subset into finer-grained bounded rings that can be dynamically adjusted as points change. The design of DBRI lends itself to easy adoption in a distributed setting. We further present the distributed high-dimensional kNN query algorithm (DHDKNN) based on DBRI, aiming at reducing both the communication and the computational cost of query processing. The experiments demonstrate that our algorithm scales well and significantly outperforms the existing methods.

Highlights

  • As user-generated contents are experiencing an explosive rate of growth, recommender systems offer users a great way to access information of their particular interest out of the vast volume of data

  • To provide effective real-time recommendation, we focus on a popular approach to recommender systems – content-based filtering, which is based on a characterizing description/representation of the item and a profile of the user interests [1], [2]

  • 1) We propose a framework of content-based real-time recommendation based on k nearest neighbors (kNN) query processing on data streams

Read more

Summary

Introduction

As user-generated contents are experiencing an explosive rate of growth, recommender systems offer users a great way to access information of their particular interest out of the vast volume of data. For social applications where user interests change rapidly over time and contents become available as continuous streams, traditional recommender systems no longer suffice, because most traditional recommender systems assume that the user interests are stable and operate in an offline fashion. Of updating the recommendations online accounting for the most recent changes in user interests and data items. To provide effective real-time recommendation, we focus on a popular approach to recommender systems – content-based filtering, which is based on a characterizing description/representation of the item and a profile of the user interests [1], [2]. With a given distance metric, we recommend to the user those items whose

Objectives
Methods
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call