Abstract

Exploratory analysis of high throughput gene sample time (GST) data has an impact in biomedical and bioinformatics research. Mining gene expression pattern in such three dimensional data facilitate in understanding hidden biological knowledge as well as underlying complex gene regulatory mechanism. In particular, we propose a novel semi-supervised Pathway-based Order Preserving Triclustering (POPTric) algorithm which records gene expression pattern under subset of sample across subset of time points. Nowadays, integration of external knowledge has gaining popularity in the field of gene expression data analysis. Our algorithm is able to identify different types of triclusters such as additive, multiplicative, and additive-and-multiplicative triclusters. POPTric at first identifies biclusters and then it discovers triclusters from the 3D data. We apply our algorithm in artificial datasets as well as breast cancer, HIV datasets, and compare with other state-of-the-art methods. To establish the biological significance of proposed algorithm we perform Gene Ontology enrichment analysis. We also identify hub genes from the tricluster which are previously known to be associated with breast cancer.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call