Anti-monotone Property Research Articles

Keyword search has been widely studied to retrieve relevant substructures from graphs for a given set of keywords. However, existing well-studied approaches aim at finding compact trees/subgraphs containing the keywords, and ignore a critical measure, density, to represent how strongly and stably the keyword nodes are connected in the substructure. In this paper, given a set of keywords <inline-formula><tex-math notation="LaTeX">$Q = \lbrace w_1, w_2, \ldots, w_l\rbrace$</tex-math></inline-formula> , we study the problem of finding a cohesive subgraph containing <inline-formula><tex-math notation="LaTeX">$Q$</tex-math></inline-formula> with high density and compactness from a graph <inline-formula><tex-math notation="LaTeX">$G$</tex-math></inline-formula> . We model the cohesive subgraph based on a carefully chosen <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -truss model, and formulate the problem of finding cohesive subgraphs for keyword queries as <i>minimal dense truss</i> search problem, i.e., finding minimal subgraph that maximizes the trussness covering <inline-formula><tex-math notation="LaTeX">$Q$</tex-math></inline-formula> . However, unlike <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -truss based community search that can be efficiently done based on the local search from a given set of nodes, <i>minimal dense truss</i> search for keyword queries is a nontrivial task as the subset of keyword nodes to be included in the retrieved substructure is previously unknown. To tackle this problem, we first design a novel hybrid KT-Index to keep the keyword and truss information compacly, and then propose an efficient algorithm that carries the search on KT-Index directly to find the dense truss with the maximum trussness <inline-formula><tex-math notation="LaTeX">$G_{den}$</tex-math></inline-formula> without repeated accesses to the original graph. Then, we develop a novel refinement approach to extract minimal dense truss from the dense truss <inline-formula><tex-math notation="LaTeX">$G_{den}$</tex-math></inline-formula> , by checking each node at most once based on the anti-monotonicity property derived from <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> -truss, together with several optimization strategies including batch based deletion, early-stop based deletion, and local exploration. Moreover, we also extend the proposed method to deal with the top- <inline-formula><tex-math notation="LaTeX">$r$</tex-math></inline-formula> search. Extensive experimental studies on real-world networks validated the effectiveness and efficiency of our approaches.

Read full abstract

Since outliers are the major factors that affect accuracy in data science, many outlier detection approaches have been proposed for effectively identifying the implicit outliers from static datasets, thereby improving the reliability of the data. In recent years, data streams have been the main form of data, and the data elements in a data stream are not always of equal importance. However, the existing outlier detection approaches do not consider the weight conditions; hence, these methods are not suitable for processing weighted data streams. In addition, the traditional pattern-based outlier detection approaches incur a high time cost in the outlier detection phase. Aiming at overcoming these problems, this paper proposes a two-phase pattern-based outlier detection approach, namely, WMFP-Outlier, for effectively detecting the implicit outliers from a weighted data stream, in which the maximal frequent patterns are used instead of the frequent patterns to accelerate the process of outlier detection. In the process of maximal frequent-pattern mining, the anti-monotonicity property and MFP-array structure are used to accelerate the mining operation. In the process of outlier detection, three deviation indices are designed for measuring the degree of abnormality of each transaction, and the transactions with the highest degrees of abnormality are judged as outliers. Last, several experimental studies are conducted on a synthetic dataset to evaluate the performance of the proposed WMFP-Outlier approach. The results demonstrate that the accuracy of the WMFP-Outlier approach is higher compared to the existing pattern-based outlier detection approaches, and the time cost of the outlier detection phase of WMFP-Outlier is lower than those of the other four compared pattern-based outlier detection approaches.

Read full abstract

Anti-monotone Property Research Articles

Related Topics

Articles published on Anti-monotone Property

Mining Spatial Co-Location Patterns With a Mixed Prevalence Measure.

Totally-ordered Sequential Rules for Utility Maximization

Effective Community Search on Large Attributed Bipartite Graphs

SCPM-CR: A Novel Method for Spatial Co-Location Pattern Mining With Coupling Relation Consideration

Discovering Periodic Patterns in Time Series from Twitter Data Set

Effective community search over large star-schema heterogeneous information networks

A chaotic system with attractor coexistence and its synchronization circuit implementation

Finding Route Hotspots in Large Labeled Networks

Efficient discovery of co-location patterns from massive spatial datasets with or without rare features

Efficient list based mining of high average utility patterns with maximum average pruning strategies

Route to Chaos in an Electrostatic Ion Cyclotron with Higher-Order Source Term

Fracmemristor chaotic oscillator with multistable and antimonotonicity properties

UWFP-Outlier: an efficient frequent-pattern-based outlier detection method for uncertain weighted data streams

Cohesive Subgraph Search Using Keywords in Large Networks

Efficient Discovery of Weighted Frequent Neighborhood Itemsets in Very Large Spatiotemporal Databases

WMFP-Outlier: An Efficient Maximal Frequent-Pattern-Based Outlier Detection Approach for Weighted Data Streams

Antimonotonicity and multistability in a fractional order memristive chaotic oscillator

Efficiently mining cohesion-based patterns and rules in event sequences

Efficiently Approximating Top-$k$ Sequential Patterns in Transactional Graphs

Selective Database Projections Based Approach for Mining High-Utility Itemsets

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Anti-monotone Property Research Articles

Related Topics

Articles published on Anti-monotone Property

Mining Spatial Co-Location Patterns With a Mixed Prevalence Measure.

Totally-ordered Sequential Rules for Utility Maximization

Effective Community Search on Large Attributed Bipartite Graphs

SCPM-CR: A Novel Method for Spatial Co-Location Pattern Mining With Coupling Relation Consideration

Discovering Periodic Patterns in Time Series from Twitter Data Set

Effective community search over large star-schema heterogeneous information networks

A chaotic system with attractor coexistence and its synchronization circuit implementation

Finding Route Hotspots in Large Labeled Networks

Efficient discovery of co-location patterns from massive spatial datasets with or without rare features

Efficient list based mining of high average utility patterns with maximum average pruning strategies

Route to Chaos in an Electrostatic Ion Cyclotron with Higher-Order Source Term

Fracmemristor chaotic oscillator with multistable and antimonotonicity properties

UWFP-Outlier: an efficient frequent-pattern-based outlier detection method for uncertain weighted data streams

Cohesive Subgraph Search Using Keywords in Large Networks

Efficient Discovery of Weighted Frequent Neighborhood Itemsets in Very Large Spatiotemporal Databases

WMFP-Outlier: An Efficient Maximal Frequent-Pattern-Based Outlier Detection Approach for Weighted Data Streams

Antimonotonicity and multistability in a fractional order memristive chaotic oscillator

Efficiently mining cohesion-based patterns and rules in event sequences

Efficiently Approximating Top-$k$ Sequential Patterns in Transactional Graphs

Selective Database Projections Based Approach for Mining High-Utility Itemsets