The identifying hidden data features problem solution

S Y Petrova,M A Boikova

doi:10.1088/1742-6596/1352/1/012039

Abstract

In the article, we considered recommender models based on matrix factorization demonstrate excellent performance in collaborative filtering. The standard Matrix Factorization approach in MLlib deals with clear ratings. To work with implicit data, we used the trainImplicit method. To simulate the processing of real-time data streams, we used the Spark Streaming library, which is responsible for receiving data from the input source and converting the raw data into a discretized stream discretized stream (DStream) consisting of Spark RDD. The rank parameter determines the number of hidden features in the low rank approximation matrices. As a rule, the greater the number of factors, the better, but for a large number of users or elements, it will directly affect the memory usage of the computing system and the amount of data required for training. Therefore, in our problem it was a compromise solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The identifying hidden data features problem solution

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Oct 1, 2019
License type: cc-by

Similar Papers

Efficiency of Stream Processing Engines for Processing BIGDATA Streams
B V S Srikanth ... V Krishna Reddy
Indian Journal of Science and Technology | VOL. 9
B V S Srikanth, et. al.B V S Srikanth ... V Krishna Reddy
29 Apr 2016
Indian Journal of Science and Technology | VOL. 9

Multi-Source Information Fusion Technology and Its Application in Smart Distribution Power System
Xi He ... Wanli Yang
Sustainability | VOL. 15
Xi He, et. al.Xi He ... Wanli Yang
03 Apr 2023
Sustainability | VOL. 15

Adaptive Real Time IoT Stream Processing in Microservices Architecture
...
-
, et. al. ...
01 Jan 2020
01 Jan 2020

A hybrid approach combining real-time and archived data for mobility analysis
Loic Salmon ... Cyril Ray
-
Loic Salmon, et. al.Loic Salmon ... Cyril Ray
03 Nov 2015
03 Nov 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The identifying hidden data features problem solution

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series