Efficient preference clustering via random fourier features

Jingshu Liu,Jinglei Liu,Li Wang

doi:10.26599/bdma.2019.9020003

Jingshu Liu, Jinglei Liu + Show 1 more

Open Access

https://doi.org/10.26599/bdma.2019.9020003

Copy DOI

Abstract

Approximations based on random Fourier features have recently emerged as an efficient and elegant method for designing large-scale machine learning tasks. Unlike approaches using the Nystrom method, which randomly samples the training examples, we make use of random Fourier features, whose basis functions (i.e., cosine and sine ) are sampled from a distribution independent from the training sample set, to cluster preference data which appears extensively in recommender systems. Firstly, we propose a two-stage preference clustering framework. In this framework, we make use of random Fourier features to map the preference matrix into the feature matrix, soon afterwards, utilize the traditional k-means approach to cluster preference data in the transformed feature space. Compared with traditional preference clustering, our method solves the problem of insufficient memory and greatly improves the efficiency of the operation. Experiments on movie data sets containing 100 000 ratings, show that the proposed method is more effective in clustering accuracy than the Nystrom and k-means, while also achieving better performance than these clustering approaches.

Highlights

With the rapid development of information technology, data storage has been made relatively inexpensive and abundant, resulting in extremely large data sets
Compared with the traditional preference clustering approach, this paper makes the following contributions: (1) We present a two-stage framework for clustering preference data
We can see that for the same data set, our PCRFF approach is superior to the k-means and Nystrom approaches

Summary

Introduction

With the rapid development of information technology, data storage has been made relatively inexpensive and abundant, resulting in extremely large data sets. Data mining provides us with an effective way to explore and analyze hidden patterns behind these data. These data sets share one prominent feature: which is enormity in size with tens of thousands of objects and features. Data sets are often sparse, so, how to excavate hidden patterns is a important problem. Clustering is an effective method which groups a set of objects in such a way that objects in the same group are more similar to each other than to those in other groups[1].

Objectives

Methods

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Big Data Mining and Analytics	Publication Date: Sep 1, 2019
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Efficient preference clustering via random fourier features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Big Data Mining and Analytics

Lead the way for us

Similar Papers

On the Sample Complexity of Random Fourier Features for Online Learning
Ming Lin ... Changshui Zhang
ACM Transactions on Knowledge Discovery from Data | VOL. 8
Ming Lin, et. al.Ming Lin ... Changshui Zhang
01 Jun 2014
ACM Transactions on Knowledge Discovery from Data | VOL. 8

Polarity Identification of Aspect based Sentiment Reviews
...
Indian journal of science and technology | VOL. 9
, et. al. ...
28 Dec 2016
Indian journal of science and technology | VOL. 9

Privacy by Projection: Federated Population Density Estimation by Projecting on Random Features.
Zixiao Zong ... Athina Markopoulou
Proceedings on Privacy Enhancing Technologies. Privacy Enhancing Technologies Symposium | VOL. 2023
Zixiao Zong, et. al.Zixiao Zong ... Athina Markopoulou
01 Jan 2023
Proceedings on Privacy Enhancing Technologies. Privacy Enhancing Technologies Symposium | VOL. 2023

Iteratively reweighted least square for kernel expectile regression with random features
Yue Cui ... Songfeng Zheng
Journal of Statistical Computation and Simulation | VOL. 93
Yue Cui, et. al.Yue Cui ... Songfeng Zheng
08 Mar 2023
Journal of Statistical Computation and Simulation | VOL. 93

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient preference clustering via random fourier features

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Big Data Mining and Analytics