Abstract

In this paper, based on the clustering analysis method, the author tries to study some celebrities in web blogger groups and adopts unsupervised clustering evaluation methods, which is called silhouette coeffi- cient, to evaluate the classification results of different clustering classification methods. It is concluded that K-means clustering is the best among the clustering methods compared with the traditional classifications. Fur- thermore, it is a dynamic, flexible method and can reduce restrictions of subjective consciousness using cluster analysis. As a result, K-means clustering is universal in web blogger groups' classification process.

Highlights

  • In recent years, the web blog has been developing fast, and it has played an increasingly important role in daily life

  • As for the transmission of information on web blog, Jiang Xin found that the key nodes always act as “opinion leaders”, which make the public opinion disseminate fast on the Internet

  • Ping Liang’s study showed that “opinion leaders” who have significant effects on the transmission of information can guide the public opinion in some degree

Read more

Summary

INTRODUCTION

The web blog has been developing fast, and it has played an increasingly important role in daily life. As for the transmission of information on web blog, Jiang Xin found that the key nodes always act as “opinion leaders”, which make the public opinion disseminate fast on the Internet. Defining these key nodes helps to guide public opinion. In this study about celebrity web blog users, Zhao Yu classified them into two categories (active and realistic celebrity and native celebrity) qualitatively or three categories (information source, opinion leader and initiator of social activities) according to the roles celebrities play. As a foundation of classification, the former method can’t classify web blog users in reality The latter method is limited by the formula people define, and it’s difficult to deal with the formula’s new feature and precise analysis. To find a better clustering method, it uses K-means, Two-steps and Kohonen to do and compares the results of three methods and reputation index

The reputation index
Cluster analysis
Silhouette coefficient
Source of data
Experimental results
-Summary
CONCLUSIONS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.