Abstract

The construction of Weibo user profiles in hot events can help to grasp the characteristics of Weibo users involved in the events, which is conductive to the relevant department to strengthen public opinion guidance and propaganda education. Taking the “Viya's tax evasion” case as an example, firstly, the Latent Dirichlet allocation (LDA) topic model is used to construct a topic model ofWeibo content in the case, and the optimal number of topics is determined by perplexity. Then, the k-prototype algorithm and the improved Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm are respectively used to cluster Weibo users and analyze the similarities and differences between the various categories of users. At last, the clustering results of the two algorithms are compared. The experiments show that the topic generation method based on the LDA topic model has a good effect on describing discussion topics. In the process of data containing mixed attributes, the k-Prototype algorithm and the improved DBSCAN algorithm have their respective advantages, and the combined results of the two algorithms can obtain a more complete user group portrait.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.