A General Probabilistic Framework for Mining Labeled Ordered Trees

Nobuhisa Ueda,Hiroshi Mamitsuka,Kiyoko F Aoki

doi:10.1137/1.9781611972740.33

Abstract

We propose a new probabilistic model for mining labeled ordered trees. A noteworthy feature of the proposed model is to consider ordered siblings by modeling the dependencies of a node in a tree on the elder sibling as well as the parent. This model is reasonably extended from a variety of existing probabilistic models for strings and trees. We further propose a new learning/mining method to estimate the parameters of this model, based on an EM algorithm. This is also an extension of those for various simpler probabilistic models, such as hidden Markov models and hidden tree Markov models. We evaluated the effectiveness of our proposed method using both synthetic and real-world data sets, comparing the results with those of several simpler probabilistic models. Experimental results have shown that our proposed method outperforms the other methods compared, being statistically significant in all cases tested. This result tells us that the proposed methodology is highly effective for mining labeled ordered trees, which have recently emerged as one of the typical data structures in numerous data mining domains, including the web, text mining and bioinformatics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A General Probabilistic Framework for Mining Labeled Ordered Trees

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Application of Hidden Markov and Hidden Semi-Markov Models to Financial Time Series
Jan Bulla
-
Jan BullaJan Bulla
20 Feb 2022
20 Feb 2022

On some special-purpose hidden Markov models
Roland Langrock
-
Roland LangrockRoland Langrock
20 Feb 2022
20 Feb 2022

Hybrid Metaheuristic Approaches to the Expectation Maximization for Estimation of the Hidden Markov Model for Signal Modeling
Shamsul Huda ... Roberto Togneri
IEEE Transactions on Cybernetics | VOL. 44
Shamsul Huda, et. al.Shamsul Huda ... Roberto Togneri
01 Oct 2014
IEEE Transactions on Cybernetics | VOL. 44

An online em algorithm in hidden (semi-)Markov models for audio segmentation and clustering
Alberto Bietti ... Arshia Cont
-
Alberto Bietti, et. al.Alberto Bietti ... Arshia Cont
11 Feb 2015
11 Feb 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A General Probabilistic Framework for Mining Labeled Ordered Trees

Abstract

Talk to us

Similar Papers