Abstract

With the continuous growth of the film market and the consumption demand of audiences, the value of film content has become increasingly apparent. Extracting movie content elements is an important step in quantitative analysis of movie content. In this paper, based on text mining technology, deep data analysis of movie reviews is carried out using TF-IDF and machine learning to visualize high-frequency words, and sentiment analysis is performed on reviews to find out the hidden deep information behind the big data of movie reviews. Although the extracted keywords can draw the content and characteristics of the film for us, they cannot establish the correlation with the creative elements of the film and television works. Therefore, in this paper, the extracted keywords are clustered to find the central words of these words. First of all, in order to improve the representativeness of keywords, the Epoch data set and the Batchsize data set are used to conduct experimental analysis on the model in this paper. Through comparative experiments, it is concluded that when Batchsize = 32 and Epoch = 25, the model achieves the optimal classification effect. The analysis shows that when the training times are too small, the model is not fully learned, and when the training times are too many, the model will be overfitted, which will reduce the accuracy of the model, resulting in the opposite effect. The keywords extracted above can draw the content and characteristics of the movie for us.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call