Relevance Feature Discovery for Text Mining

Yuefeng Li,Moch Arif Bijaksana,Mubarak Albathan,Abdulmohsen Algarni,Yan Shen

doi:10.1109/tkde.2014.2373357

Abstract

It is a big challenge to guarantee the quality of discovered relevance features in text documents for describing user preferences because of large scale terms and data patterns. Most existing popular text mining and classification methods have adopted term-based approaches. However, they have all suffered from the problems of polysemy and synonymy. Over the years, there has been often held the hypothesis that pattern-based methods should perform better than term-based ones in describing user preferences; yet, how to effectively use large scale patterns remains a hard problem in text mining. To make a breakthrough in this challenging issue, this paper presents an innovative model for relevance feature discovery. It discovers both positive and negative patterns in text documents as higher level features and deploys them over low-level features (terms). It also classifies terms into categories and updates term weights based on their specificity and their distributions in patterns. Substantial experiments using this model on RCV1, TREC topics and Reuters-21578 show that the proposed model significantly outperforms both the state-of-the-art term-based methods and the pattern based methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Relevance Feature Discovery for Text Mining

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jun 1, 2015
Citations: 58

Similar Papers

Pattern document weight discovery for text classification mining
S Brindha ... S Sukumaran
-
S Brindha, et. al.S Brindha ... S Sukumaran
01 Oct 2016
01 Oct 2016

Mining positive and negative patterns for relevance feature discovery
Yuefeng Li ... Ning Zhong
-
Yuefeng Li, et. al.Yuefeng Li ... Ning Zhong
25 Jul 2010
25 Jul 2010

An Advanced Fuzzy Constructing Algorithm for Feature Discovery in Text Mining
Evana Ramalakshmi ... Subhakar Golla
International Journal of Computer Applications | VOL. 127
Evana Ramalakshmi, et. al.Evana Ramalakshmi ... Subhakar Golla
15 Oct 2015
International Journal of Computer Applications | VOL. 127

Relevance Feature Discovery for Text Mining
Vikrant Sharma
Mathematical Statistician and Engineering Applications | VOL. 70
Vikrant SharmaVikrant Sharma
31 Jan 2021
Mathematical Statistician and Engineering Applications | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Relevance Feature Discovery for Text Mining

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering