Learning ontology resolution for document representation and its applications in text mining

Lidong Bing,Shan Jiang,Bai Sun,Yan Zhang,Wai Lam

doi:10.1145/1871437.1871711

Abstract

It is well known that synonymous and polysemous terms often bring in some noises when calculating the similarity between documents. Existing ontology-based document representation methods are static, hence, the chosen semantic concept set for representing a document has a fixed resolution and it is not adaptable to the characteristics of a document collection and the text mining problem in hand. We propose an Adaptive Concept Resolution (ACR) model to overcome this issue. ACR can learn a concept border from an ontology taking into consideration of the characteristics of a particular document collection. Then this border can provide a tailor-made semantic concept representation for a document coming from the same domain. Another advantage of ACR is that it is applicable in both classification task where the groups are given in the training document set, and clustering task where no group information is available. Furthermore, the result of this model is not sensitive to the model parameter. The experimental results show that ACR outperforms an existing static method significantly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning ontology resolution for document representation and its applications in text mining

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Adaptive Concept Resolution for document representation and its applications in text mining
Lidong Bing ... Shoaib Jameel
Knowledge-Based Systems | VOL. 74
Lidong Bing, et. al.Lidong Bing ... Shoaib Jameel
01 Nov 2014
Knowledge-Based Systems | VOL. 74

Text Mining for Supply Chain Risk Management in the Apparel Industry
Sayed Mehdi Shah ... Michael Freitag
Applied Sciences | VOL. 11
Sayed Mehdi Shah, et. al.Sayed Mehdi Shah ... Michael Freitag
05 Mar 2021
Applied Sciences | VOL. 11

An accuracy-enhanced light stemmer for arabic text
Samhaa R El-Beltagy ... Ahmed Rafea
ACM Transactions on Speech and Language Processing | VOL. 7
Samhaa R El-Beltagy, et. al.Samhaa R El-Beltagy ... Ahmed Rafea
24 Feb 2010
ACM Transactions on Speech and Language Processing | VOL. 7

An improved Urdu stemming algorithm for text mining based on multi-step hybrid approach
Abdul Jabbar ... Qaisar Abbas
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 30
Abdul Jabbar, et. al.Abdul Jabbar ... Qaisar Abbas
22 May 2018
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning ontology resolution for document representation and its applications in text mining

Abstract

Talk to us

Similar Papers