Tuning Word2vec for Large Scale Recommendation Systems

Benjamin P Chamberlain,Michael M Bronstein,Suvash Sedhain,Emanuele Rossi,Dan Shiebler

doi:10.1145/3383313.3418486

Abstract

Word2vec is a powerful machine learning tool that emerged from Natural Lan-guage Processing (NLP) and is now applied in multiple domains, including recom-mender systems, forecasting, and network analysis. As Word2vec is often used offthe shelf, we address the question of whether the default hyperparameters are suit-able for recommender systems. The answer is emphatically no. In this paper, wefirst elucidate the importance of hyperparameter optimization and show that un-constrained optimization yields an average 221% improvement in hit rate over thedefault parameters. However, unconstrained optimization leads to hyperparametersettings that are very expensive and not feasible for large scale recommendationtasks. To this end, we demonstrate 138% average improvement in hit rate with aruntime budget-constrained hyperparameter optimization. Furthermore, to makehyperparameter optimization applicable for large scale recommendation problemswhere the target dataset is too large to search over, we investigate generalizinghyperparameters settings from samples. We show that applying constrained hy-perparameter optimization using only a 10% sample of the data still yields a 91%average improvement in hit rate over the default parameters when applied to thefull datasets. Finally, we apply hyperparameters learned using our method of con-strained optimization on a sample to the Who To Follow recommendation serviceat Twitter and are able to increase follow rates by 15%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tuning Word2vec for Large Scale Recommendation Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

EEG reveals that dextroamphetamine improves cognitive control through multiple processes in healthy participants.
Savita G Bhakta ... John A Nungaray
Neuropsychopharmacology | VOL. 47
Savita G Bhakta, et. al.Savita G Bhakta ... John A Nungaray
18 Jan 2022
Neuropsychopharmacology | VOL. 47

Physics-Informed AI-based Modelling for Flood Early Warning Systems
Farzad Piadeh ... Kourosh Behzadian
-
Farzad Piadeh, et. al.Farzad Piadeh ... Kourosh Behzadian
08 Mar 2024
08 Mar 2024

Increase of the particle hit rate in a laser single-particle mass spectrometer by pulse delayed extraction technology
Ying Chen ... Sergei Nikiforov
Atmospheric Measurement Techniques | VOL. 13
Ying Chen, et. al.Ying Chen ... Sergei Nikiforov
28 Feb 2020
Atmospheric Measurement Techniques | VOL. 13

Trade-off between Hit Rate and Hit Latency for Optimizing DRAM Cache
Pai Chen ... Xiaofei Liao
IEEE Transactions on Emerging Topics in Computing | VOL. 9
Pai Chen, et. al.Pai Chen ... Xiaofei Liao
01 Jan 2018
IEEE Transactions on Emerging Topics in Computing | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tuning Word2vec for Large Scale Recommendation Systems

Abstract

Talk to us

Similar Papers