情報管理入門

Pei Yang ,Wei Gao ,Qi Tan ,Kam-Fai Wong

doi:10.1241/johokanri.27.412

Abstract

Transfer learning utilizes labeled data available from some related domain (source domain) for achieving effective knowledge transformation to the target domain. However, most state-of-the-art cross-domain classification methods treat documents as plain text and ignore the hyperlink (or citation) relationship existing among the documents. In this paper, we propose a novel cross-domain document classification approach called Link-Bridged Topic model (LBT). LBT consists of two key steps. Firstly, LBT utilizes an auxiliary link net- work to discover the direct or indirect co-citation relationship among documents by embedding the background knowledge into a graph kernel. The mined co-citation relation- ship is leveraged to bridge the gap across different domains. Secondly, LBT simultaneously combines the content information and link structures into a unified latent topic model. The model is based on an assumption that the documents of source and target domains share some common topics from the point of view of both content information and link struc- ture. By mapping both domains data into the latent topic spaces, LBT encodes the knowl- edge about domain commonality and difference as the shared topics with associated differential probabilities. The learned latent topics must be consistent with the source and target data, as well as content and link statistics. Then the shared topics act as the bridge to facilitate knowledge transfer from the source to the target domains. Experiments on different types of datasets show that our algorithm significantly improves the general- ization performance of cross-domain document classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

情報管理入門

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing and Management

Lead the way for us

Journal: Journal of Information Processing and Management	Publication Date: Jan 1, 1984
License type: free

Similar Papers

A link-bridged topic model for cross-domain document classification
Pei Yang ... Kam-Fai Wong
Information Processing & Management | VOL. 49
Pei Yang, et. al.Pei Yang ... Kam-Fai Wong
22 Jun 2013
Information Processing & Management | VOL. 49

Information Processing and Management
P M Thankachan ... R Vijayakumar
-
P M Thankachan, et. al.P M Thankachan ... R Vijayakumar
01 Jan 2009
01 Jan 2009

Learning Transferable Convolutional Proxy by SMI-Based Matching Technique
Wei Jin ... Nan Jia
Shock and Vibration | VOL. 2020
Wei Jin, et. al.Wei Jin ... Nan Jia
14 Oct 2020
Shock and Vibration | VOL. 2020

Overcoming learning bias via Prototypical Feature Compensation for source-free domain adaptation
Zicheng Pan ... Yongsheng Gao
Pattern Recognition | VOL. 158
Zicheng Pan, et. al.Zicheng Pan ... Yongsheng Gao
17 Sep 2024
Pattern Recognition | VOL. 158

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

情報管理入門

Abstract

Talk to us

Similar Papers

More From: Journal of Information Processing and Management