The Network Representation Learning Algorithm Based on Semi-Supervised Random Walk

Dong Liu,Qinpeng Li,Yan Ru,Jun Zhang

doi:10.1109/access.2020.3044367

Dong Liu, Qinpeng Li + Show 2 more

Open Access

PDF Available

https://doi.org/10.1109/access.2020.3044367

Copy DOI

Export

Save

Cite

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 5	License type: CC BY 4.0

Affiliation: Henan Normal University, Zhengzhou University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

As an important tool of social network analysis, network representation learning also called network embedding maps the network to a latent space and learns low-dimensional and dense real vectors of nodes, while preserving the structure and internal attributes of network. The learned representations or embedding vectors can be used for node clustering, link prediction, network visualization and other tasks for network analysis. Most of the existing network representation learning algorithms mainly focus on the preservation of micro or macro network structure, ignoring the mesoscopic community structure information. Although a few network embedding methods are proposed to preserve the community structure, they all ignore the prior information about communities. Inspired by the semi-supervised community detection in complex networks, in this article, a novel Semi-Supervised DeepWalk method(SSDW) is proposed for network representation learning, which successfully preserves the community structure of network in the embedding space. Specifically, a semi-supervised random walk sampling method which effectively integrates the pairwise constraints is proposed. By doing so, the SSDW model can guide the transition probability in the random walk process and obtain the node context sequence in line with the prior knowledge. The experimental results on eight real networks show that comparing with the popular network embedding methods, the node representation vectors integrating pairwise constraints into the random walk process can obtain higher accuracy on node clustering task, and the results of link prediction, network visualization tasks indicate that the semi-supervised model SSDW is more discriminative than unsupervised ones.

Highlights

Complex network analysis is of great significance to study different systems, to explore the complex relationships among social networks, and to discover the attributes and contents of academic networks
FRAMEWORK This paper proposes a network representation learning model SSDW based on semi-supervised random walk
Through the community structure information reflected by pairwise constraints, the neighborhood structure of nodes is modified, and the sampling probability of the node is changed, which affects the final learned node representation vectors

Summary

Introduction

Complex network analysis is of great significance to study different systems, to explore the complex relationships among social networks, and to discover the attributes and contents of academic networks. With the exponential growth of internet data scale, the relationship between nodes in networks becomes more and more complex. The traditional network analysis technologies are difficult to adapt to large-scale data mining and machine learning tasks [1]. Called network embedding, can preserve the structure and internal attributes of the network and learn effective low-dimensional representation vectors for nodes. The obtained node representation vectors can be used as node features to carry out subsequent network analysis tasks, such as community detection, node classification, link prediction, network visualization, and so on [2]. Line [4] describes the first-order and second-order similarity of nodes with two different functions

Methods

Results

Conclusion