Multi-Level Cross-Modal Alignment for Image Clustering

Liping Qiu,Qin Zhang,Xiaojun Chen,Shaotian Cai

doi:10.1609/aaai.v38i13.29387

Abstract

Recently, the cross-modal pretraining model has been employed to produce meaningful pseudo-labels to supervise the training of an image clustering model. However, numerous erroneous alignments in a cross-modal pretraining model could produce poor-quality pseudo labels and degrade clustering performance. To solve the aforementioned issue, we propose a novel Multi-level Cross-modal Alignment method to improve the alignments in a cross-modal pretraining model for downstream tasks, by building a smaller but better semantic space and aligning the images and texts in three levels, i.e., instance-level, prototype-level, and semantic-level. Theoretical results show that our proposed method converges, and suggests effective means to reduce the expected clustering risk of our method. Experimental results on five benchmark datasets clearly show the superiority of our new method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Level Cross-Modal Alignment for Image Clustering

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition
Zhuojun Ding ... Xiaoye Qu
-
Zhuojun Ding, et. al.Zhuojun Ding ... Xiaoye Qu
01 Aug 2024
01 Aug 2024

Bearing defect diagnosis based on semi-supervised kernel Local Fisher Discriminant Analysis using pseudo labels
Xinmin Tao ... Junrong Zou
ISA Transactions | VOL. 110
Xinmin Tao, et. al.Xinmin Tao ... Junrong Zou
13 Oct 2020
ISA Transactions | VOL. 110

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.
Jihan Yang ... Zhe Wang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Jihan Yang, et. al.Jihan Yang ... Zhe Wang
01 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Weighted Pseudo Labeled Data and Mutual Learning for Semi-Supervised Classification
Jianwen Mo ... Hua Yuan
IEEE Access | VOL. 9
Jianwen Mo, et. al.Jianwen Mo ... Hua Yuan
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Level Cross-Modal Alignment for Image Clustering

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence