Information retrieval from huge social web data is a challenging task for conventional search engines. Recently, information filtering recommender systems may help to find movies, however, their services are limited because of not considering movie quality aspects in detail. Prediction of movies can be improved by using the characteristics of social web content about a movie such as social-quality, tag quality, and a temporal aspect. In this paper, we have proposed to utilize several features of social quality, user reputation and temporal features to predict popular or highly rated movies. Moreover, enhanced optimization-based voting classifier is proposed to improve the performance on proposed features. Voting classifier uses the knowledge of all the candidate classifiers but ignores the performance of the model on different classes. In the proposed model, weight is assigned to each model based on its performance for each class. For the optimal selection of weights for the candidate classifiers, Genetic Algorithm is used and the proposed model is called Genetic Algorithm Voting (GA-V) classifier. After labeling the suggested features by using a fixed threshold, several classifiers like Bayesian logistic regression, Naïve Bayes, BayesNet, Random Forest, SVM, Decision Tree, LSTM and AdaboostM1 are trained on MovieLens dataset to find high-quality/popular movies in different categories. All the traditional ML models are compared with GA-V in terms of precision, recall and F1 score. The results show the significance of the proposed features and proposed GA-V classifier.
Read full abstract