Minimized-cost cube query on heterogeneous information networks

Dan Yin,Zhaonian Zou,Hong Gao,Jianzhong Li

doi:10.1007/s10878-015-9967-6

Abstract

Data cube is the foundation of on-line analytical processing (OLAP), which can provide users with data views from different perspectives and granularities. Heterogeneous information networks consist of multiple types of nodes and edges which represent different semantic relations. With the rapid development of social networks and knowledge graphs, heterogeneous information networks have become increasingly popular. In heterogeneous information networks, cube is the set of aggregate graphs and cube query is required for supporting OLAP. The existing research mainly studies aggregate graph query on homogeneous networks, but only considers the attributes of nodes. To overcome these challenges, this paper investigates cube query problem on heterogeneous information networks. (1) A novel cube model for heterogeneous information networks is proposed, which captures both the attribute and structure semantics. (2) Because the total number of aggregate graphs is huge, computing and storing them cost plenty of time and storage. The problem of partial cube materialization on heterogeneous information networks is investigated. Given a fixed size of memory space, select a subset of aggregate graphs in cube, to minimize the computing cost of the whole cube. This optimization problem is proved to be NP-complete and there is no $$n^{1-{\epsilon }}$$n1-∈ approximation algorithm unless P $$=$$= NP. (3) A greedy algorithm is proposed for partial cube materialization based on two interesting dependencies between aggregate graphs, attribute dependence and path dependence. (4) Experiments on real world data sets show the cube definition is meaningful, and the partial cube materialization algorithm is efficient.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minimized-cost cube query on heterogeneous information networks

Abstract

Talk to us

Similar Papers

More From: Journal of Combinatorial Optimization

Lead the way for us

Journal: Journal of Combinatorial Optimization	Publication Date: Oct 22, 2015
Citations: 1

Similar Papers

Mining heterogeneous information networks
Jaiwei Han
-
Jaiwei HanJaiwei Han
12 Aug 2012
12 Aug 2012

Mining Heterogeneous Information Networks: Principles and Methodologies
Yizhou Sun ... Jiawei Han
Synthesis Lectures on Data Mining and Knowledge Discovery | VOL. 3
Yizhou Sun, et. al.Yizhou Sun ... Jiawei Han
18 Jul 2012
Synthesis Lectures on Data Mining and Knowledge Discovery | VOL. 3

Mobile APP User Attribute Prediction by Heterogeneous Information Network Modeling
Hekai Zhang ... Linfeng Du
-
Hekai Zhang, et. al.Hekai Zhang ... Linfeng Du
01 Jan 2019
01 Jan 2019

MAHE-IM: Multiple Aggregation of Heterogeneous Relation Embedding for Influence Maximization on Heterogeneous Information Network
Ying Li ... Qianqian Li
Expert Systems With Applications | VOL. 202
Ying Li, et. al.Ying Li ... Qianqian Li
26 Apr 2022
Expert Systems With Applications | VOL. 202

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimized-cost cube query on heterogeneous information networks

Abstract

Talk to us

Similar Papers

More From: Journal of Combinatorial Optimization