Abstract

Cluster analysis is important in data mining and clustering algorithms and has gained much attention during the last decade. However, it is a challenge to extract significant features from high-dimensional data and to rapidly provide satisfactory clustering results. This paper presents a new affinity propagation (AP) clustering method based on a hybrid kernel function with locally linear embedding, called LLE-HKAP, for the classification of gene expression datasets and standard UCI datasets. First, the locally linear embedding algorithm is used to reduce the dimension of the original dataset. Then, a novel AP clustering method based on a similarity measure with the hybrid kernel function is proposed. In this method, a new global kernel is defined that has high generalization ability. Meanwhile, a hybrid kernel function that linearly combines the proposed global kernel and the Gaussian kernel is defined to further enhance the learning ability of the global kernel. Moreover, the novel hybrid kernel is introduced to define a similarity measure and construct a similarity matrix of the AP clustering. Finally, the improved AP clustering algorithm is implemented on eight public gene expression datasets and eight standard UCI datasets for comparison with other related algorithms. The experimental results validate that our proposed clustering algorithm is efficient in terms of clustering accuracy and outperforms the currently available approaches with which it is compared.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.