Relevance Feature Discovery for Text Mining

Vikrant Sharma

doi:10.17762/msea.v70i1.2303

Abstract

Due to large size words also data patterns, it is difficult to ensure the quality of relevant characteristics that are found in text documents that describe user preferences. Most widely used text mining and classification techniques now in use have embraced term-based strategies. However, polysemy and synonymy issues have affected them all. The theory that pattern-based approaches should outperform term-based ones in performance in expressing user preferences has been often held throughout the years, however text mining still struggles with how to employ large-scale patterns successfully. This research introduces a novel methodology for relevance feature discovery to address this hard problem. It finds higher level features in text texts that are both positive and negative patterns and uses them instead of low-level features (terms). Additionally, it organised terms into categories and updates term weights according to the patterns and specificity of those distributions. Significant tests employing this model on the datasets RCV1, TREC themes, and Reuters-21578 reveal that it performs noticeably better than both the most advanced term-based approaches and pattern-based methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Relevance Feature Discovery for Text Mining

Abstract

Talk to us

Similar Papers

More From: Mathematical Statistician and Engineering Applications

Lead the way for us

Similar Papers

Relevance Feature Discovery for Text Mining
Yuefeng Li ... Yan Shen
IEEE Transactions on Knowledge and Data Engineering | VOL. 27
Yuefeng Li, et. al.Yuefeng Li ... Yan Shen
01 Jun 2015
IEEE Transactions on Knowledge and Data Engineering | VOL. 27

Pattern document weight discovery for text classification mining
S Brindha ... S Sukumaran
-
S Brindha, et. al.S Brindha ... S Sukumaran
01 Oct 2016
01 Oct 2016

A Theoretical Study on Advanced Techniques in Pre-processing and Text Classification
...
International Journal of Data Mining And Emerging Technologies | VOL. 5
, et. al. ...
01 Jan 2015
International Journal of Data Mining And Emerging Technologies | VOL. 5

Mining positive and negative patterns for relevance feature discovery
Yuefeng Li ... Ning Zhong
-
Yuefeng Li, et. al.Yuefeng Li ... Ning Zhong
25 Jul 2010
25 Jul 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Relevance Feature Discovery for Text Mining

Abstract

Talk to us

Similar Papers

More From: Mathematical Statistician and Engineering Applications