Mining significant tree patterns in carbohydrate sugar chains

K Hashimoto,I Takigawa,M Shiga,M Kanehisa,H Mamitsuka

doi:10.1093/bioinformatics/btn293

Abstract

Carbohydrate sugar chains or glycans, the third major class of macromolecules, hold branch shaped tree structures. Glycan motifs are known to be two types: (1) conserved patterns called 'cores' containing the root and (2) ubiquitous motifs which appear in external parts including leaves and are distributed over different glycan classes. Finding these glycan tree motifs is an important issue, but there have been no computational methods to capture these motifs efficiently. We have developed an efficient method for mining motifs or significant subtrees from glycans. The key contribution of this method is: (1) to have proposed a new concept, 'á-closed frequent subtrees', and an efficient method for mining all these subtrees from given trees and (2) to have proposed to apply statistical hypothesis testing to rerank the frequent subtrees in significance. We experimentally verified the effectiveness of the proposed method using real glycans: (1)We examined the top 10 subtrees obtained by our method at some parameter setting and confirmed that all subtrees are significant motifs in glycobiology. (2) We applied the results of our method to a classification problem and found that our method outperformed other competing methods, SVM with three different tree kernels, being all statistically significant. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining significant tree patterns in carbohydrate sugar chains

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Aug 9, 2008
Citations: 42

Similar Papers

Mining Frequent Subtrees in Glycan Data Using the Rings Glycan Miner Tool
Kiyoko Flora Aoki-Kinoshita
-
Kiyoko Flora Aoki-KinoshitaKiyoko Flora Aoki-Kinoshita
08 Sep 2012
08 Sep 2012

Guest Editors' Introduction: Special Issue on Mining Biological Data
Wei Wang ... Jiong Yang
IEEE Transactions on Knowledge and Data Engineering | VOL. 17
Wei Wang, et. al. Wei Wang ... Jiong Yang
01 Aug 2005
IEEE Transactions on Knowledge and Data Engineering | VOL. 17

A probabilistic model for mining labeled ordered trees: capturing patterns in carbohydrate sugar chains
N Ueda ... H Mamitsuka
IEEE Transactions on Knowledge and Data Engineering | VOL. 17
N Ueda, et. al.N Ueda ... H Mamitsuka
01 Aug 2005
IEEE Transactions on Knowledge and Data Engineering | VOL. 17

A Simple Yet Efficient Approach for Maximal Frequent Subtrees Extraction from a Collection of XML Documents
Juryon Paik ... Ung Mo Kim
-
Juryon Paik, et. al.Juryon Paik ... Ung Mo Kim
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining significant tree patterns in carbohydrate sugar chains

Abstract

Talk to us

Similar Papers

More From: Bioinformatics