Correlation range query for effective recommendations

Wenjun Zhou,Hao Zhang

doi:10.1007/s11280-013-0265-x

Abstract

Efficient correlation computation has been an active research area of data mining. Given a large dataset and a specified query item, we are interested in finding items in the dataset that are within certain range of correlation with the query item. Such a problem, known as the correlation range query, has been a common task in many application domains. In this paper, we identify piecewise monotone properties of the upper and lower bounds of the ? coefficient, and propose an efficient correlation range query algorithm, called CORAQ. The CORAQ algorithm effectively prunes many items without computing their actual correlation coefficients with the query item. CORAQ also attains completeness and correctness of the query results. Experiments with large benchmark datasets show that this algorithm is much faster than its brute-force alternative and scales well with large datasets. As case studies, real-world datasets from recommendation applications are analyzed to demonstrate that CORAQ can help effectively identify interesting items to recommend to users.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Correlation range query for effective recommendations

Abstract

Talk to us

Similar Papers

More From: World Wide Web

Lead the way for us

Journal: World Wide Web	Publication Date: Nov 24, 2013
Citations: 4

Similar Papers

Correlation Range Query
Wenjun Zhou ... Hao Zhang
-
Wenjun Zhou, et. al.Wenjun Zhou ... Hao Zhang
01 Jan 2013
01 Jan 2013

Minimizing Data Waste: Conservation in the Big Data Era
Allison D Binley ... Brandon P M Edwards
The Bulletin of the Ecological Society of America | VOL. 104
Allison D Binley, et. al.Allison D Binley ... Brandon P M Edwards
10 Mar 2023
The Bulletin of the Ecological Society of America | VOL. 104

A complementary graphical method for reducing and analyzing large data sets. Case studies demonstrating thresholds setting and selection.
J J Cimino ... X Jing
Methods of information in medicine | VOL. 53
J J Cimino, et. al.J J Cimino ... X Jing
01 Jan 2014
Methods of information in medicine | VOL. 53

An Accurate MDS-Based Algorithm for the Visualization of Large Multidimensional Datasets
Antoine Naud
-
Antoine NaudAntoine Naud
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Correlation range query for effective recommendations

Abstract

Talk to us

Similar Papers

More From: World Wide Web