Efficiently Mining Frequent Embedded Unordered Trees

Mohammed J Zaki

doi:10.5555/1227174.1227177

Efficiently Mining Frequent Embedded Unordered Trees

Mohammed J Zaki

https://doi.org/10.5555/1227174.1227177

Copy DOI

Journal: Fundamenta Informaticae	Publication Date: Nov 1, 2004
Citations: 94

Affiliation: Rensselaer Polytechnic Institute

#Unordered Trees #Mining Frequent Trees + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Mining frequent trees is very useful in domains like bioinformatics, web mining, mining semi-structured data, and so on. In this paper we introduce SLEUTH, an efficient algorithm for mining frequent, unordered, embedded subtrees in a database of labeled trees. The key contributions of our work are as follows: We give the first algorithm that enumerates all embedded, unordered trees. We propose a new equivalence class extension scheme to generate all candidate trees. We extend the notion of scope-list joins to compute frequency of unordered trees. We conduct performance evaluation on several synthetic and real datasets to show that SLEUTH is an efficient algorithm, which has performance comparable to TreeMiner, that mines only ordered trees.

Full Text