Automatic content extraction and time-aware topic clustering for large-scale social network on cloud platform

Chunlin Li,Jingpan Bai

doi:10.1007/s11227-018-2704-z

Abstract

In recent years, with the increase in users in social network, the social network has had the feature of big data. The large-scale social network has become an indispensable part in people’s life. However, the traditional data mining technology cannot suit the large-scale social network. Thus, it is urgent to develop a more suitable mining technology for the large-scale social network. In this section, a crawler model based on semantic analysis and spatial clustering is proposed firstly. Then, the content extraction model based on document object model tree is built to extract the target text information from the links fetched by the proposed crawler model. The similarities between textual information in different regions are computed to choose the important information. Moreover, a two-stage topic clustering model based on time information is presented. The time information is introduced into the similarity computation between two posts or clusters. The single-pass algorithm is improved and applied in different clustering stage to improve the clustering accuracy. Finally, the proposed algorithms are evaluated on Hadoop platform. The Hadoop platform can effectively reduce the computing time and improve the server quality of users in large-scale social network. Meanwhile, the experiments demonstrate that the proposed algorithms are suitable for the data processing in large-scale social network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic content extraction and time-aware topic clustering for large-scale social network on cloud platform

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Journal: The Journal of Supercomputing	Publication Date: Nov 26, 2018
Citations: 4

Similar Papers

Inferring Missing Attributes of Users in Large-Scale Social networks
Huadeng Wang ... Xiaonan Luo
-
Huadeng Wang, et. al.Huadeng Wang ... Xiaonan Luo
01 Jun 2019
01 Jun 2019

Finding top-k influential users in social networks under the structural diversity model
Wenzheng Xu ... Jeffrey Xu Yu
Information Sciences | VOL. 355-356
Wenzheng Xu, et. al.Wenzheng Xu ... Jeffrey Xu Yu
24 Mar 2016
Information Sciences | VOL. 355-356

Identifying Influential Users by Improving LeaderRank
Yong Yao ... Cong Ji
-
Yong Yao, et. al.Yong Yao ... Cong Ji
07 Nov 2019
07 Nov 2019

Finding Influential Users in Online Social Networks: A Tree based Approach
...
-
, et. al. ...
06 Feb 2019
06 Feb 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic content extraction and time-aware topic clustering for large-scale social network on cloud platform

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing