Hierarchical Maximum Likelihood Clustering Approach.

Alok Sharma,Michiaki Kubo,Yoichiro Kamatani,Tatsuhiko Tsunoda,Daichi Shigemizu,Keith A Boroevich

doi:10.1109/tbme.2016.2542212

Abstract

In this paper, we focused on developing a clustering approach for biological data. In many biological analyses, such as multiomics data analysis and genome-wide association studies analysis, it is crucial to find groups of data belonging to subtypes of diseases or tumors. Conventionally, the k-means clustering algorithm is overwhelmingly applied in many areas including biological sciences. There are, however, several alternative clustering algorithms that can be applied, including support vector clustering. In this paper, taking into consideration the nature of biological data, we propose a maximum likelihood clustering scheme based on a hierarchical framework. This method can perform clustering even when the data belonging to different groups overlap. It can also perform clustering when the number of samples is lower than the data dimensionality. The proposed scheme is free from selecting initial settings to begin the search process. In addition, it does not require the computation of the first and second derivative of likelihood functions, as is required by many other maximum likelihood-based methods. This algorithm uses distribution and centroid information to cluster a sample and was applied to biological data. A MATLAB implementation of this method can be downloaded from the web link http://www.riken.jp/en/research/labs/ims/med_sci_math/.

Highlights

T HE aim of unsupervised clustering algorithms is to partition the data into clusters
We carry out analysis on artificial data as well as on biological data to evaluate the performance of hierarchical maximum likelihood (HML)
We proposed a hierarchical maximum likelihood (HML) method by considering the topologies of genomic data

Summary

Introduction

T HE aim of unsupervised clustering algorithms is to partition the data into clusters. In this case, the class label information is unknown; i.e., the knowledge regarding the state of the nature of samples is not provided and clustering is performed by taking into account a similarity or distance measure, distribution information or by some objective functions. In biological data (e.g. genomic data, transcriptomic data) the number of clusters, as well as the location of clusters, are unknown. It would be beneficial to develop a scheme that takes into account the distribution information as well

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on bio-medical engineering	Publication Date: Mar 24, 2016
Citations: 76	License type: other-oa

R Discovery Prime

R Discovery Prime

Hierarchical Maximum Likelihood Clustering Approach.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE transactions on bio-medical engineering

Lead the way for us

Similar Papers

Stepwise iterative maximum likelihood clustering approach.
Alok Sharma ... Tatsuhiko Tsunoda
BMC Bioinformatics | VOL. 17
Alok Sharma, et. al.Alok Sharma ... Tatsuhiko Tsunoda
24 Aug 2016
BMC Bioinformatics | VOL. 17

Survey on Multi-omics, and Multi-omics Data Analysis, Integration and Application
Mohamad Hesam Shahrajabian ... Wenli Sun
Current Pharmaceutical Analysis | VOL. 19
Mohamad Hesam Shahrajabian, et. al.Mohamad Hesam Shahrajabian ... Wenli Sun
01 May 2023
Current Pharmaceutical Analysis | VOL. 19

Integrated IBD Analysis, GWAS Analysis and Transcriptome Analysis to Identify the Candidate Genes for White Spot Disease in Maize.
Dong Wang ... Yunfang Zhu
International Journal of Molecular Sciences | VOL. 24
Dong Wang, et. al.Dong Wang ... Yunfang Zhu
11 Jun 2023
International Journal of Molecular Sciences | VOL. 24

Multi-Omics Data Mining Techniques: Algorithms and Software
Min Tang ... Yi Liu
-
Min Tang, et. al.Min Tang ... Yi Liu
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Maximum Likelihood Clustering Approach.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE transactions on bio-medical engineering