A study of user profile representation for personalized cross-language information retrieval

Dong Zhou,Wenyu Zhao,Xuan Wu,Séamus Lawless,Jianxun Liu

doi:10.1108/ajim-06-2015-0091

Abstract

Purpose– With an increase in the amount of multilingual content on the World Wide Web, users are often striving to access information provided in a language of which they are non-native speakers. The purpose of this paper is to present a comprehensive study of user profile representation techniques and investigate their use in personalized cross-language information retrieval (CLIR) systems through the means of personalized query expansion.Design/methodology/approach– The user profiles consist of weighted terms computed by using frequency-based methods such as tf-idf and BM25, as well as various latent semantic models trained on monolingual documents and cross-lingual comparable documents. This paper also proposes an automatic evaluation method for comparing various user profile generation techniques and query expansion methods.Findings– Experimental results suggest that latent semantic-weighted user profile representation techniques are superior to frequency-based methods, and are particularly suitable for users with a sufficient amount of historical data. The study also confirmed that user profiles represented by latent semantic models trained on a cross-lingual level gained better performance than the models trained on a monolingual level.Originality/value– Previous studies on personalized information retrieval systems have primarily investigated user profiles and personalization strategies on a monolingual level. The effect of utilizing such monolingual profiles for personalized CLIR remains unclear. The current study fills the gap by a comprehensive study of user profile representation for personalized CLIR and a novel personalized CLIR evaluation methodology to ensure repeatable and controlled experiments can be conducted.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A study of user profile representation for personalized cross-language information retrieval

Abstract

Talk to us

Similar Papers

More From: Aslib Journal of Information Management

Lead the way for us

Journal: Aslib Journal of Information Management	Publication Date: Jul 18, 2016
Citations: 10

Similar Papers

Speech and text query based Tamil - English Cross Language Information Retrieval system
P Iswarya ... V Radha
-
P Iswarya, et. al.P Iswarya ... V Radha
01 Jan 2014
01 Jan 2014

A comprehensive survey on cross-language information retrieval system
Gouranga Charan Jena ... Siddharth Swarup Rautaray
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 14
Gouranga Charan Jena, et. al.Gouranga Charan Jena ... Siddharth Swarup Rautaray
01 Apr 2019
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 14

A Proposal to Study of Cross Language Information Retrieval (CLIR) System Users’ Information Seeking Behavior
Yoojin Ha
-
Yoojin HaYoojin Ha
01 Jan 2014
01 Jan 2014

A Proposal to Study of Cross Language Information Retrieval (CLIR) System Users' Information Seeking Behavior
Yoojin Ha
-
Yoojin HaYoojin Ha
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A study of user profile representation for personalized cross-language information retrieval

Abstract

Talk to us

Similar Papers

More From: Aslib Journal of Information Management