Abstract
Graph embedding aims to represent node structural as well as attribute information into a low-dimensional vector space so that some downstream application tasks such as node classification, link prediction, community detection, and recommendation can be easily performed by using simple machine learning algorithms. The graph convolutional network is a neural network framework for machine learning on graphs. Because of its powerful ability to model graph data, it is currently the best choice for graph embedding. However, most existing graph convolutional network-based embedding algorithms not only ignore the data distribution of the latent codes but also lose the high-order proximity between nodes in a graph, leading to inferior embedding. To mitigate this problem, we investigate how to enforce latent codes to match a prior distribution, and we introduce random walk to preserve high-order proximity in a graph. In this paper, we propose a novel graph embedding framework, Adversarial and Random Walk Regularized Graph Embedding (ARWR-GE), which jointly preserves structural and attribute information. ARWR-GE adopts an adversarial training scheme to enforce the latent codes to match a prior distribution, and by employing the skip-gram model, nodes in a random walk sequence are closer in the latent space. We evaluate our proposed framework by using three real-world datasets on link prediction, graph clustering, and visualization tasks. The results demonstrate that our framework achieves better performance than state-of-the-art graph embedding algorithms.
Highlights
Graphs are universal data structures that can represent complex relational data that ubiquitously exist in the real world, including social networks [1], [2], paper citation networks [3], [4], protein-protein interaction networks [5], [6], etc
OUR MODEL Combining the advantages of the above components, we propose a graph embedding framework based on the graph autoencoder: Adversarial and Random walk Regularized Graph Embedding(ARWR-GE), which can transform high-dimensional complex graph data into a robust, lowdimensional latent representation that preserves the graph structure and attribute information
A general observation that we can draw from Fig. (2) is that ARWR-GE and ARWR-VGE consistently obtain the best AUC results on the three datasets
Summary
Graphs are universal data structures that can represent complex relational data that ubiquitously exist in the real world, including social networks [1], [2], paper citation networks [3], [4], protein-protein interaction networks [5], [6], etc. The high computational complexity, low parallelizability, and inapplicability of machine learning methods to graph data have made traditional graph analysis algorithms including shortest path, centrality measurement, etc., profoundly challenging. The associate editor coordinating the review of this manuscript and approving it for publication was Jing Bi. space to low-dimensional vector space while encoding structural and semantic information. Because of the universality of the embedding vectors, graph embedding technology can be applied to many fields and tasks such as social networks and recommender systems [14] by using the off-the-shelf machine learning method
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have