Abstract
Network is a powerful language to represent relational data. One way to understand network is to analyze groups of nodes which share same properties or functions. The task of discovering such groups is known as community detection. The community detection in real-life networks, the majority of which are weighted temporal text networks, is confronted with two main problems - how to model the weight of edges and how to exploit the temporal information. Existing works either ignore the edge weight or utilize it in graph measures like modularity, which lacks scalability. And currently the common-used method involving temporal information is to discretize the time, which leads to series of problems. We are thus motivated to present a new method to encode the edge weight and temporal information. A probabilistic generative model, named Custom Temporal Community Detection (CTCD) is introduced, which views the link between two nodes as a weighted edge with several time stamps. Our model utilizes network, semantic and temporal information simultaneously to extract temporal community affiliations for individual user, influence strength across communities and temporal interested topic in each community. An efficient inference method, which scales linearly, and corresponding parallel implementation are proposed to adapt to large datasets. Through the knowledge extracted by CTCD, we are able to spot the community shift of the individual user, to which little attention has been given, and employ it to track the development of the communities over time. Moreover, experiments on two large-scale weighted temporal text networks show that CTCD gains significant improvement over state-of-the-art methods on a series of tasks.
Highlights
From blogs to video-sharing sites to social networks and still others, online social media have experienced rapid growth over the past half-decade [2]
A probabilistic generative model based on Bayesian network, named Custom Temporal Community Detection (CTCD), is introduced
MODEL STRUCTURE To integrate the text, time and network information into a unified framework, we propose the probabilistic generative model, CTCD (Custom Temporal Community Detection)
Summary
From blogs to video-sharing sites to social networks and still others, online social media have experienced rapid growth over the past half-decade [2]. A probabilistic generative model based on Bayesian network, named Custom Temporal Community Detection (CTCD), is introduced. Post time stamps and interaction time stamps are generated by CTCD Taking both weight information and temporal information in to account, CTCD is able to extract temporal community affiliations for each user, the influence strength across communities and temporal interested topics in each community at the same time. By modeling time in this way, VOLUME 8, 2020 our model is able to extract temporal community affiliation strength for each user, and avoids problems brought by time discretization in existing works. To simplify the analysis or design, they are often formulated as two-value networks (1 denotes edge existence, otherwise 0) in previous community detection works [7], [11], [12], which leads to loss of information. Model, CTCD-Ber, we can get a view of the improvement brought by taking edge weight into account
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.