Abstract
Matrix factorization (MF), as one of the most accurate and scalable approaches in dimension reduction techniques, has become popular in the collaborative filter- ing (CF) recommender systems communities. Currently, Non- negative Matrix Factorization (NMF) is one of the most famous approaches for MF, due to its representative non-negativity fea- ture for CF model. However, it is non-trivial to obtain high per- formance of sparse NMF (SNMF) on Graphic Processing Units (GPU) for large-scale problems, due to the redundant large- scale intermediate data, frequent matrices manipulation, and access on the sparse rating matrix with irregular distribution non-zero entries. In this work, we propose single-thread- based SNMF, which depends on the involved feature tuples multiplication and summation, and then, we present L2 norm regularized single-thread-based SNMF. On that basis, a novel CUDA parallelization NMF approach (CuSNMF) is presented for GPU computing. Furthermore, to process large-scale CF data sets and make advantages of GPU computation power, we propose multi-GPU CuSNMF (MCuSNMF). Compared with state-of-the-art parallel algorithms, CCD++, and CUMF, MCuSNMF obtains the highest performance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.