Adaptive Concept Resolution for document representation and its applications in text mining

Lidong Bing,Shan Jiang,Wai Lam,Yan Zhang,Shoaib Jameel

doi:10.1016/j.knosys.2014.10.003

Abstract

It is well-known that synonymous and polysemous terms often bring in some noise when we calculate the similarity between documents. Existing ontology-based document representation methods are static so that the selected semantic concepts for representing a document have a fixed resolution. Therefore, they are not adaptable to the characteristics of document collection and the text mining problem in hand. We propose an Adaptive Concept Resolution (ACR) model to overcome this problem. ACR can learn a concept border from an ontology taking into the consideration of the characteristics of the particular document collection. Then, this border provides a tailor-made semantic concept representation for a document coming from the same domain. Another advantage of ACR is that it is applicable in both classification task where the groups are given in the training document set and clustering task where no group information is available. The experimental results show that ACR outperforms an existing static method in almost all cases. We also present a method to integrate Wikipedia entities into an expert-edited ontology, namely WordNet, to generate an enhanced ontology named WordNet-Plus, and its performance is also examined under the ACR model. Due to the high coverage, WordNet-Plus can outperform WordNet on data sets having more fresh documents in classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Concept Resolution for document representation and its applications in text mining

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Nov 1, 2014
Citations: 63

Similar Papers

Learning ontology resolution for document representation and its applications in text mining
Lidong Bing ... Wai Lam
-
Lidong Bing, et. al.Lidong Bing ... Wai Lam
26 Oct 2010
26 Oct 2010

Text Mining for Supply Chain Risk Management in the Apparel Industry
Sayed Mehdi Shah ... Michael Freitag
Applied Sciences | VOL. 11
Sayed Mehdi Shah, et. al.Sayed Mehdi Shah ... Michael Freitag
05 Mar 2021
Applied Sciences | VOL. 11

An improved Urdu stemming algorithm for text mining based on multi-step hybrid approach
Abdul Jabbar ... Qaisar Abbas
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 30
Abdul Jabbar, et. al.Abdul Jabbar ... Qaisar Abbas
22 May 2018
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 30

A Review of Towered Big-Data Service Model for Biomedical Text-Mining Databases
Alshreef Abed ... Lin Li
International Journal of Advanced Computer Science and Applications | VOL. 8
Alshreef Abed, et. al.Alshreef Abed ... Lin Li
01 Jan 2017
International Journal of Advanced Computer Science and Applications | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Concept Resolution for document representation and its applications in text mining

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems