Abstract

At present, cluster analysis has become a very good channel for analyzing gene expression data to obtain biological information. In recent years, many experts have used traditional clustering algorithms and new clustering algorithms to mine gene expression data. This article first introduces the preprocessing of gene expression data. Then, by using principal component analysis (PCA) to process the gene data, a small number of characteristic variables are extracted as new indicators, and the indicators are evaluated to achieve the purpose of dimensionality reduction. The dimension reduction index is applied to the dynamic self-organizing neural network (DSOM) neural network, and the victory neurons are selected by the minimum Euclidean distance. The characteristics of the genetic data are clustered through the DSOM network, and the gene types with similar characteristics are divided into one region. The results show that PCA and DSOM networks have a high accuracy rate for clustering of genetic data, and a clear division of boundaries.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.