Multilingual novelty detection

Flora S Tsai,Yi Zhang,Agus T Kwee,Wenyin Tang

doi:10.1016/j.eswa.2010.07.016

Abstract

Novelty detection aims at reducing redundant information from a chronologically ordered list of documents or sentences. Other studies of novelty detection have been conducted on the English language, but few papers have addressed the problem of multilingual novelty detection. Likewise, research in multilingual information retrieval have rarely been applied to novelty detection. This paper attempts to bridge the two disciplines by first describing the preprocessing steps for English, Malay and Chinese, then applying document and sentence-level novelty detection for the three languages on APWSJ and TREC 2004 Novelty Track data. Experiments on sentence-level novelty detection show similar results for all three languages, which indicates that our algorithm is suitable for multilingual novelty detection at the sentence level. However, results for document-level novelty detection show a disparity across the different languages, with English and Malay outperforming Chinese. After applying sentence-level novelty detection to detect novel documents, we observe substantial improvements on all three languages. This demonstrates that segmenting documents into sentences improves document-level novelty detection in multiple languages, and has practical benefits for a real-time multilingual novelty detection system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multilingual novelty detection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Jul 16, 2010
Citations: 5

Similar Papers

Improving search effectiveness in sentence retrieval and novelty detection
Ronald T Fernández
ACM SIGIR Forum | VOL. 45
Ronald T FernándezRonald T Fernández
24 May 2011
ACM SIGIR Forum | VOL. 45

Database optimization for novelty detection
Ong Chun Lin ... Flora S Tsai
-
Ong Chun Lin, et. al.Ong Chun Lin ... Flora S Tsai
01 Dec 2009
01 Dec 2009

Anytime online novelty and change detection for mobile robots
Boris Sofman ... J Andrew Bagnell
Journal of Field Robotics | VOL. 28
Boris Sofman, et. al.Boris Sofman ... J Andrew Bagnell
21 Jun 2011
Journal of Field Robotics | VOL. 28

Sentence-Level Novelty Detection in English and Malay
Agus T Kwee ... Wenyin Tang
-
Agus T Kwee, et. al.Agus T Kwee ... Wenyin Tang
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilingual novelty detection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications