Semi-supervised deep embedded clustering with pairwise constraints and subset allocation

Yalin Wang,Jiangfeng Zou,Kai Wang,Chenliang Liu,Xiaofeng Yuan

doi:10.1016/j.neunet.2023.04.016

Abstract

Semi-supervised deep clustering methods attract much attention due to their excellent performance on the end-to-end clustering task. However, it is hard to obtain satisfying clustering results since many overlapping samples in industrial text datasets strongly and incorrectly influence the learning process. Existing methods incorporate prior knowledge in the form of pairwise constraints or class labels, which not only largely ignore the correlation between these two supervision information but also cause the problem of weak-supervised constraint or incorrect strong-supervised label guidance. In order to tackle these problems, we propose a semi-supervised method based on pairwise constraints and subset allocation (PCSA-DEC). We redefine the similarity-based constraint loss by forcing the similarity of samples in the same class much higher than other samples and design a novel subset allocation loss to precisely learn strong-supervised information contained in labels which consistent with unlabeled data. Experimental results on the two industrial text datasets show that our method can yield 8.2%–8.7% improvement in accuracy and 13.4%–19.8% on normalized mutual information over the state-of-the-art method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-supervised deep embedded clustering with pairwise constraints and subset allocation

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Journal: Neural Networks	Publication Date: Apr 20, 2023
Citations: 4

Similar Papers

Semi-supervised Clustering Using Bayesian Regularization
Zuobing Xu ... Ram Akella
-
Zuobing Xu, et. al.Zuobing Xu ... Ram Akella
01 Oct 2007
01 Oct 2007

Active deep image clustering
Bicheng Sun ... Xuejun Li
Knowledge-Based Systems | VOL. 252
Bicheng Sun, et. al.Bicheng Sun ... Xuejun Li
01 Jul 2022
Knowledge-Based Systems | VOL. 252

Semi-supervised discriminative clustering with graph regularization
Marek Śmieja ... Jacek Tabor
Knowledge-Based Systems | VOL. 151
Marek Śmieja, et. al.Marek Śmieja ... Jacek Tabor
12 Mar 2018
Knowledge-Based Systems | VOL. 151

MSC-CSMC: A multi-objective semi-supervised clustering algorithm based on constraints selection and multi-source constraints for gene expression data.
Zeyuan Wang ... Hong Gu
Frontiers in genetics | VOL. 14
Zeyuan Wang, et. al.Zeyuan Wang ... Hong Gu
27 Feb 2023
Frontiers in genetics | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-supervised deep embedded clustering with pairwise constraints and subset allocation

Abstract

Talk to us

Similar Papers

More From: Neural Networks