Finding the most descriptive substructures in graphs with discrete and numeric labels

Michael Davis,Paul Miller,Weiru Liu

doi:10.1007/s10844-013-0299-7

Abstract

Many graph datasets are labelled with discrete and numeric attributes. Most frequent substructure discovery algorithms ignore numeric attributes; in this paper we show how they can be used to improve search performance and discrimination. Our thesis is that the most descriptive substructures are those which are normative both in terms of their structure and in terms of their numeric values. We explore the relationship between graph structure and the distribution of attribute values and propose an outlier-detection step, which is used as a constraint during substructure discovery. By pruning anomalous vertices and edges, more weight is given to the most descriptive substructures. Our method is applicable to multi-dimensional numeric attributes; we outline how it can be extended for high-dimensional data. We support our findings with experiments on transaction graphs and single large graphs from the domains of physical building security and digital forensics, measuring the effect on runtime, memory requirements and coverage of discovered patterns, relative to the unconstrained approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finding the most descriptive substructures in graphs with discrete and numeric labels

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Information Systems

Lead the way for us

Journal: Journal of Intelligent Information Systems	Publication Date: Dec 27, 2013
Citations: 2

Similar Papers

Finding the Most Descriptive Substructures in Graphs with Discrete and Numeric Labels
Michael Davis ... Paul Miller
-
Michael Davis, et. al.Michael Davis ... Paul Miller
01 Jan 2013
01 Jan 2013

CCGraMi: An Effective Method for Mining Frequent Subgraphs in a Single Large Graph
Lam B Q Nguyen ... Ivan Zelinka
MENDEL | VOL. 27
Lam B Q Nguyen, et. al.Lam B Q Nguyen ... Ivan Zelinka
21 Dec 2021
MENDEL | VOL. 27

Fast and scalable algorithms for mining subgraphs in a single large graph
Lam B.Q Nguyen ... Ivan Zelinka
Engineering Applications of Artificial Intelligence | VOL. 90
Lam B.Q Nguyen, et. al.Lam B.Q Nguyen ... Ivan Zelinka
24 Feb 2020
Engineering Applications of Artificial Intelligence | VOL. 90

Mining Association Rules from a Single Large Graph
Bao Huynh ... Bay Vo
Cybernetics and Systems | VOL. 55
Bao Huynh, et. al.Bao Huynh ... Bay Vo
24 Dec 2022
Cybernetics and Systems | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding the most descriptive substructures in graphs with discrete and numeric labels

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Information Systems