Application of Fuzzy c-Means Clustering in Data Analysis of Metabolomics

Xiang Li,Hongwei Kong,Jing Tian,Peng Gao,Xin Lu,Guowang Xu

doi:10.1021/ac900353t

Abstract

Fuzzy c-means (FCM) clustering is an unsupervised method derived from fuzzy logic that is suitable for solving multiclass and ambiguous clustering problems. In this study, FCM clustering is applied to cluster metabolomics data. FCM is performed directly on the data matrix to generate a membership matrix which represents the degree of association the samples have with each cluster. The method is parametrized with the number of clusters (C) and the fuzziness coefficient (m), which denotes the degree of fuzziness in the algorithm. Both have been optimized by combining FCM with partial least-squares (PLS) using the membership matrix as the Y matrix in the PLS model. The quality parameters R(2)Y and Q(2) of the PLS model have been used to monitor and optimize C and m. Data of metabolic profiles from three gene types of Escherichia coli were used to demonstrate the method above. Different multivariable analysis methods have been compared. Principal component analysis failed to model the metabolite data, while partial least-squares discriminant analysis yielded results with overfitting. On the basis of the optimized parameters, the FCM was able to reveal main phenotype changes and individual characters of three gene types of E. coli. Coupled with PLS, FCM provides a powerful research tool for metabolomics with improved visualization, accurate classification, and outlier estimation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Application of Fuzzy c-Means Clustering in Data Analysis of Metabolomics

Abstract

Talk to us

Similar Papers

More From: Analytical Chemistry

Lead the way for us

Journal: Analytical Chemistry	Publication Date: May 1, 2009
Citations: 75

Similar Papers

FCM 클러스터링 알고리즘과 퍼지 결정트리를 이용한 상황인식 정보 서비스
Seokhwan Yang ... Mokdong Chung
Journal of Korea Multimedia Society | VOL. 16
Seokhwan Yang, et. al.Seokhwan Yang ... Mokdong Chung
31 Jul 2013
Journal of Korea Multimedia Society | VOL. 16

Automatic structure identification of TSK fuzzy model for stock index forecasting
Jungwon Yu ... Sungshin Kim
-
Jungwon Yu, et. al.Jungwon Yu ... Sungshin Kim
01 Aug 2015
01 Aug 2015

Local segmentation of images using an improved fuzzy C-means clustering algorithm based on self-adaptive dictionary learning
Jiaqing Miao ... Ting-Zhu Huang
Applied Soft Computing | VOL. 91
Jiaqing Miao, et. al.Jiaqing Miao ... Ting-Zhu Huang
03 Mar 2020
Applied Soft Computing | VOL. 91

정보 입자화를 통한 방사형 기저 함수 기반 다항식 신경 회로망의 진화론적 설계
...
The Transactions of The Korean Institute of Electrical Engineers | VOL. 60
, et. al. ...
01 Apr 2011
The Transactions of The Korean Institute of Electrical Engineers | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of Fuzzy c-Means Clustering in Data Analysis of Metabolomics

Abstract

Talk to us

Similar Papers

More From: Analytical Chemistry