On Investigating Both Effectiveness and Efficiency of Embedding Methods in Task of Similarity Computation of Nodes in Graphs

Masoud Reyhani Hamedani,Sang-Wook Kim

doi:10.3390/app11010162

Abstract

One of the important tasks in a graph is to compute the similarity between two nodes; link-based similarity measures (in short, similarity measures) are well-known and conventional techniques for this task that exploit the relations between nodes (i.e., links) in the graph. Graph embedding methods (in short, embedding methods) convert nodes in a graph into vectors in a low-dimensional space by preserving social relations among nodes in the original graph. Instead of applying a similarity measure to the graph to compute the similarity between nodes a and b, we can consider the proximity between corresponding vectors of a and b obtained by an embedding method as the similarity between a and b. Although embedding methods have been analyzed in a wide range of machine learning tasks such as link prediction and node classification, they are not investigated in terms of similarity computation of nodes. In this paper, we investigate both effectiveness and efficiency of embedding methods in the task of similarity computation of nodes by comparing them with those of similarity measures. To the best of our knowledge, this is the first work that examines the application of embedding methods in this special task. Based on the results of our extensive experiments with five well-known and publicly available datasets, we found the following observations for embedding methods: (1) with all datasets, they show less effectiveness than similarity measures except for one dataset, (2) they underperform similarity measures with all datasets in terms of efficiency except for one dataset, (3) they have more parameters than similarity measures, thereby leading to a time-consuming parameter tuning process, (4) increasing the number of dimensions does not necessarily improve their effectiveness in computing the similarity of nodes.

Highlights

Nowadays, graphs are becoming increasingly important since they are natural representations to encode relational structures in many domains, where nodes represent the domain’s objects and links to their pairwise relationships [1,2,3,4,5,6,7]
In Section 4.2.2, for each dataset, we find the best values of d for which the embedding methods show their highest accuracies in similarity computation of nodes
We apply the similarity measures to our five datasets on eight iterations; for each similarity measure with a dataset, we find out the best iteration on which the similarity measure shows its highest accuracy

Summary

Introduction

Graphs are becoming increasingly important since they are natural representations to encode relational structures in many domains (e.g., app’s function-call diagrams, brain-region functional activities, bio-medical drug molecules, protein interaction networks, citation networks, and social networks), where nodes represent the domain’s objects and links to their pairwise relationships [1,2,3,4,5,6,7]. Computing the similarity score between two nodes based on the graph structure is a fundamental task in a wide range of applications such as recommender systems, spam detection, graph clustering [8,9], web page ranking, citation analysis, social network analysis, k-nearest neighbor search [1,9], synonym expansion (i.e., search engine’s query rewriting and text simplification), and lexicon extraction (i.e., automatically building bilingual lexicons from text corpora) [10].

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Dec 26, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

On Investigating Both Effectiveness and Efficiency of Embedding Methods in Task of Similarity Computation of Nodes in Graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Embedding Methods or Link-based Similarity Measures, Which is Better for Link Prediction?
Masoud Reyhani Hamedani ... Sang-Wook Kim
-
Masoud Reyhani Hamedani, et. al.Masoud Reyhani Hamedani ... Sang-Wook Kim
17 Nov 2021
17 Nov 2021

AdaSim
Masoud Rehyani Hamedani ... Sang-Wook Kim
-
Masoud Rehyani Hamedani, et. al.Masoud Rehyani Hamedani ... Sang-Wook Kim
26 Oct 2021
26 Oct 2021

Node Embedding Research Over Temporal Graph
Anbiao Wu ... Yuliang Ma
International Journal of Software and Informatics | VOL. 11
Anbiao Wu, et. al.Anbiao Wu ... Yuliang Ma
01 Jan 2020
International Journal of Software and Informatics | VOL. 11

Gaussian Embedding of Large-Scale Attributed Graphs
Bhagya Hettige ... Weiqing Wang
-
Bhagya Hettige, et. al.Bhagya Hettige ... Weiqing Wang
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Investigating Both Effectiveness and Efficiency of Embedding Methods in Task of Similarity Computation of Nodes in Graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences