Abstract

Link prediction is a fundamental problem of data science, which usually calls for unfolding the mechanisms that govern the micro-dynamics of networks. In this regard, using features obtained from network embedding for predicting links has drawn widespread attention. Although methods based on edge features or node similarity have been proposed to solve the link prediction problem, many technical challenges still exist due to the unique structural properties of networks, especially when the networks are sparse. From the graph mining perspective, we first give empirical evidence of the inconsistency between heuristic and learned edge features. Then, we propose a novel link prediction framework, AdaSim, by introducing an Adaptive Similarity function using features obtained from network embedding based on random walks. The node feature representations are obtained by optimizing a graph-based objective function. Instead of generating edge features using binary operators, we perform link prediction solely leveraging the node features of the network. We define a flexible similarity function with one tunable parameter, which serves as a penalty of the original similarity measure. The optimal value is learned through supervised learning and thus is adaptive to data distribution. To evaluate the performance of our proposed algorithm, we conduct extensive experiments on eleven disparate networks of the real world. Experimental results show that AdaSim achieves better performance than state-of-the-art algorithms and is robust to different sparsities of the networks.

Highlights

  • Networks have recently emerged as an important tool for representing and analyzing many kinds of interacting systems ranging from biological to social science [1]

  • (iii) We demonstrate the effectiveness of AdaSim by conducting experiments on various disparate networks of the real-world. e results show that the proposed method can boost the performance of link prediction in different degrees

  • In order to obtain the following results, we set the parameters in line with the typical values in [26]. at is, d 128, k 10, l 80, λ 10, and the optimization is run for a single epoch

Read more

Summary

Introduction

Networks have recently emerged as an important tool for representing and analyzing many kinds of interacting systems ranging from biological to social science [1]. In social networks like Facebook, only part of the friendships among users are shown by the observed network, and there still exist user pairs who already know each other but are not connected through Facebook Due to this, it is always a challenging yet meaningful task to identify which pairs of nodes not connected in the current network are likely to be connected in the actual network, i.e., predicting missing links. It is always a challenging yet meaningful task to identify which pairs of nodes not connected in the current network are likely to be connected in the actual network, i.e., predicting missing links Acquiring such knowledge is Complexity useful, for example, in biological domain, it gives invaluable guidance to carry out targeted experiments, and in social network domain, it can be used to recommend promising friendships, enhancing users’ loyalties to web services

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call