Extraction of Interlingual Documents Clusters Based on Closed Concepts Mining

Mohamed Chebel,Eric Gaussier,Chiraz Latiri

doi:10.1016/j.procs.2015.08.176

Abstract

Abstract To address multilingual document classification in an effcient and effective manner, we claim that a synergy between classical IR techniques such as vector model and some advanced data mining methods, especially Formal Concept Analysis, is particularly appropriate. We propose in this paper, a new statistical approach for extracting inter-language clusters from multilingual documents based on Closed Concepts Mining and vector model. Formal Concept Analysis techniques are applied to extract Closed Concepts from comparable corpora; and, then, exploit these Closed Concepts and vector models in the clustering and alignment of multilin- gual documents. An experimental evaluation is conducted on the collection of bilingual documents French-English of CLEF’2003. The results confirmed that the synergy between Formal Concept Analysis and vector model is fruitful to extract bilingual classes of documents, with an interesting comparability score.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2015
Citations: 5	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Extraction of Interlingual Documents Clusters Based on Closed Concepts Mining

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

A New Case-Based Classifier System Using Rough Formal Concept Analysis
Puntip Pattaraintakorn ... Jirapond Tadrat
-
Puntip Pattaraintakorn, et. al.Puntip Pattaraintakorn ... Jirapond Tadrat
01 Nov 2008
01 Nov 2008

Using Formal Concept Analysis for Mining and Interpreting Patient Flows within a Healthcare Network
Nicolas Jay ... François Kohler
-
Nicolas Jay, et. al.Nicolas Jay ... François Kohler
25 Oct 2006
25 Oct 2006

SE‐FCA: A Model of Software Evolution with Formal Concept Analysis
Xiaobing Sun ... Ying Chen
Chinese Journal of Electronics | VOL. 24
Xiaobing Sun, et. al.Xiaobing Sun ... Ying Chen
01 Jan 2015
Chinese Journal of Electronics | VOL. 24

FcaR, Formal Concept Analysis with R
Pablo Cordero ... Manuel Enciso
The R Journal | VOL. 14
Pablo Cordero, et. al.Pablo Cordero ... Manuel Enciso
21 Jun 2022
The R Journal | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extraction of Interlingual Documents Clusters Based on Closed Concepts Mining

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science