Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee,Basad Al-Sarray

doi:10.52866/ijcsm.2022.02.01.004

Omar Al-Janabee, Basad Al-Sarray

Open Access

https://doi.org/10.52866/ijcsm.2022.02.01.004

Copy DOI

Abstract

The influx of data in bioinformatics is primarily in the form of DNA, RNA, and protein sequences. This condition places a significant burden on scientists and computers. Some genomics studies depend on clustering techniques to group similarly expressed genes into one cluster. Clustering is a type of unsupervised learning that can be used to divide unknown cluster data into clusters. The k-means and fuzzy c-means (FCM) algorithms are examples of algorithms that can be used for clustering. Consequently, clustering is a common approach that divides an input space into several homogeneous zones; it can be achieved using a variety of algorithms. This study used three models to cluster a brain tumor dataset. The first model uses FCM, which is used to cluster genes. FCM allows an object to belong to two or more clusters with a membership grade between zero and one and the sum of belonging to all clusters of each gene is equal to one. This paradigm is useful when dealing with microarray data. The total time required to implement the first model is 22.2589 s. The second model combines FCM and particle swarm optimization (PSO) to obtain better results. The hybrid algorithm, i.e., FCM–PSO, uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–PSO method is effective. The total time of implementation of this model is 89.6087 s. The third model combines FCM with a genetic algorithm (GA) to obtain better results. This hybrid algorithm also uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–GA method is effective. Its total time of implementation is 50.8021 s. In addition, this study uses cluster validity indexes to determine the best partitioning for the underlying data. Internal validity indexes include the Jaccard, Davies Bouldin, Dunn, Xie–Beni, and silhouette. Meanwhile, external validity indexes include Minkowski, adjusted Rand, and percentage of correctly categorized pairings. Experiments conducted on brain tumor gene expression data demonstrate that the techniques used in this study outperform traditional models in terms of stability and biological significance.

Highlights

The k-means and fuzzy c-means (FCM) algorithms are examples of algorithms that can be used for clustering
The experimental results show that the proposed hybrid FCM–particle swarm optimization (PSO) method is effective
The experimental results show that the proposed hybrid FCM–genetic algorithm (GA) method is effective

Summary

Introduction

Recent technological developments have generated massive amounts of data in various fields, including biological science fields, to the extent that analyzing data and extracting results from them have become impossible. This situation is complemented by considerable developments in the field of informatics, which have contributed to the merging of most technologies to analyze big data and provide many systematic solutions, leading to a scientific revolution in many scientific and industrial sectors. Bioinformatics is the extension of information technology to molecular biology It is useful in comparing genes and sequences in different proteins within an organism or between organisms to identify evolutionary relationships between organisms.

Methods

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Iraqi Journal for Computer Science and Mathematics	Publication Date: Feb 21, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Iraqi Journal for Computer Science and Mathematics

Lead the way for us

Similar Papers

Gene selection and dynamic neutrosophic cognitive map with bat algorithm (DNCM-BA) for diagnose of rheumatoid arthritis (RAs)
B Chithra ... R Nedunchezhian
International Journal of Engineering & Technology | VOL. 7
B Chithra, et. al.B Chithra ... R Nedunchezhian
20 Apr 2018
International Journal of Engineering & Technology | VOL. 7

Modeling and identification of irrigation station using fuzzy c-mean clustering algorithms based on particle swarm optimization
Jaouher Chrouta ... Mohamed Jemli
-
Jaouher Chrouta, et. al.Jaouher Chrouta ... Mohamed Jemli
01 Apr 2015
01 Apr 2015

Semi-hard c-means clustering with application to classifier design
H Ichihashi ... A Notsu
-
H Ichihashi, et. al.H Ichihashi ... A Notsu
01 Jul 2010
01 Jul 2010

Evaluate clustering performance and computational efficiency for PSO based fuzzy clustering methods in processing big imbalanced data
Jin Wang ... Bo Li
-
Jin Wang, et. al.Jin Wang ... Bo Li
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Iraqi Journal for Computer Science and Mathematics