Abstract

<i>Heterogeneous information networks (HINs)</i> , which are typed graphs with labeled nodes and edges, have attracted tremendous interest from academia and industry. Given two HIN nodes <inline-formula><tex-math notation="LaTeX">$s$</tex-math></inline-formula> and <inline-formula><tex-math notation="LaTeX">$t$</tex-math></inline-formula> , and a natural number <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> , we study the discovery of the <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> most important meta paths in real time, which can be used to support friend search, product recommendation, anomaly detection, and graph clustering. In this work, we argue that the shortest path between <inline-formula><tex-math notation="LaTeX">$s$</tex-math></inline-formula> and <inline-formula><tex-math notation="LaTeX">$t$</tex-math></inline-formula> may not necessarily be the most important path. As such, we combine several ranking functions, which are based on <i>frequency</i> and <i>rarity</i> , to redefine the unified importance function of the meta paths between <inline-formula><tex-math notation="LaTeX">$s$</tex-math></inline-formula> and <inline-formula><tex-math notation="LaTeX">$t$</tex-math></inline-formula> . Although this importance function can capture more information, it is very time-consuming to find top- <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> meta paths using this importance function. Therefore, we integrate this importance function into a multi-step framework, which can efficiently filter some impossible meta paths between <inline-formula><tex-math notation="LaTeX">$s$</tex-math></inline-formula> and <inline-formula><tex-math notation="LaTeX">$t$</tex-math></inline-formula> . In addition, we combine bidirectional searching algorithm with this framework to further boost the efficiency performance. The experiment on different datasets shows that our proposed method outperforms state-of-the-art algorithms in terms of effectiveness with reasonable response time.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call