InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling

Xiaobao Wu,Anh Tuan Luu,Chaoqun Liu,Liang-Ming Pan,Xinshuai Dong,Thong Nguyen

doi:10.1609/aaai.v37i11.26612

Abstract

Cross-lingual topic models have been prevalent for cross-lingual text analysis by revealing aligned latent topics. However, most existing methods suffer from producing repetitive topics that hinder further analysis and performance decline caused by low-coverage dictionaries. In this paper, we propose the Cross-lingual Topic Modeling with Mutual Information (InfoCTM). Instead of the direct alignment in previous work, we propose a topic alignment with mutual information method. This works as a regularization to properly align topics and prevent degenerate topic representations of words, which mitigates the repetitive topic issue. To address the low-coverage dictionary issue, we further propose a cross-lingual vocabulary linking method that finds more linked cross-lingual words for topic alignment beyond the translations of a given dictionary. Extensive experiments on English, Chinese, and Japanese datasets demonstrate that our method outperforms state-of-the-art baselines, producing more coherent, diverse, and well-aligned topics and showing better transferability for cross-lingual classification tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 4

Similar Papers

Normal Vector Information Registration and Comparisons with Mutual Information
Xiahai Zhuang ... Lixu Gu
-
Xiahai Zhuang, et. al.Xiahai Zhuang ... Lixu Gu
01 Aug 2006
01 Aug 2006

Normal Vector Information Registration and Comparisons with Mutual Information
Xiahai Zhuang ... Lixu Gu
-
Xiahai Zhuang, et. al.Xiahai Zhuang ... Lixu Gu
01 Aug 2006
01 Aug 2006

Rigid registration of medical images using 1D and 2D binary projections.
Panayiotis Kotsas ... Tony Dodd
Journal of Digital Imaging | VOL. 24
Panayiotis Kotsas, et. al.Panayiotis Kotsas ... Tony Dodd
18 Nov 2010
Journal of Digital Imaging | VOL. 24

Sliding-window cross-correlation and mutual information methods in the analysis of solar wind measurements
Chaoran Gu ... Verena Heidrich-Meisner
Astronomy & Astrophysics | VOL. 684
Chaoran Gu, et. al.Chaoran Gu ... Verena Heidrich-Meisner
01 Apr 2024
Astronomy & Astrophysics | VOL. 684

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence