Abstract

已有的聚类集算法基本上都是非监督聚类集成算法,这样不能利用已知信息,使得聚类集成的准确性、鲁棒性和稳定性降低。把半监督学习和聚类集成结合起来,设计半监督聚类集成模型来克服这些缺点。主要工作包括:第一,设计了基于贝叶斯网络的半监督聚类集成(semi-supervised cluster ensemble,简称SCE)模型,并对模型用变分法进行了推理求解;第二,在此基础上,给出了EM(expectation maximization)框架下的具体算法;第三,从UCI(University of Califor;The existing algorithms are mostly unsupervised algorithms of a cluster ensemble, which cannot take advantages of known information of datasets. As a result, the precision, robustness, and stability of a cluster ensemble are degraded. To conquer these disadvantages, a semi-supervised cluster ensemble (SCE) model, based on both semi-supervised learning and ensemble learning technologies, is designed in this paper. There are three main works in this paper. The first is that SCE is proposed, and the variational inference oriented SCE is illustrated in detail. The second is based on the above work: an EM (expectation maximization) algorithm of SCE is illustrated step by step. The third is that some datasets are drawn from the UCI (University of California, Irvine) machine learning database for experiments which show that both SCE and its EM algorithm are good for semi-supervised cluster ensemble and outperforms NMFS (algorithm of nonnegative-matrix-factorization based semi-supervised), semi-supervised SVM (support vector machine), and LVCE (latent variable model for cluster ensemble). The Semi-Supervised Cluster Ensemble model is first stated in this paper, and this paper includes the advantages of both the semi-supervise learning and the cluster ensemble. Therefore, its result is better than the results of semi-learning clustering and cluster ensemble.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.