TOFA: Trace Oriented Feature Analysis in Text Categorization

Jun Yan,Weiguo Fan,Qiang Yang,Ning Liu,Zheng Chen

doi:10.1109/icdm.2008.67

Abstract

Dimension reduction for large-scale text data is attracting much attention lately due to the rapid growth of World Wide Web. We can consider dimension reduction algorithms in two categories: feature extraction and feature selection. An important problem remains: it has been difficult to integrate these two algorithm categories into a single framework, making it difficult to reap the benefit of both. In this paper, we formulate the two algorithm categories through a unified optimization framework. Under this framework, we develop a novel feature selection algorithm called Trace Oriented Feature Analysis (TOFA). The novel objective function of TOFA is a unified framework that integrates many prominent feature extraction algorithms such as unsupervised Principal Component Analysis and supervised Maximum Margin Criterion are special cases of it. Thus TOFA can process not only supervised problem but also unsupervised and semi-supervised problems. Experimental results on real text datasets demonstrate the effectiveness and efficiency of TOFA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TOFA: Trace Oriented Feature Analysis in Text Categorization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Trace-Oriented Feature Analysis for Large-Scale Text Data Dimension Reduction
Jun Yan ... Zheng Chen
IEEE Transactions on Knowledge and Data Engineering | VOL. 23
Jun Yan, et. al.Jun Yan ... Zheng Chen
01 Jul 2011
IEEE Transactions on Knowledge and Data Engineering | VOL. 23

Robust Dimensionality Reduction via Low-rank Laplacian Graph Learning
Mingjian Cai ... Sirui Tian
ACM Transactions on Intelligent Systems and Technology | VOL. 14
Mingjian Cai, et. al.Mingjian Cai ... Sirui Tian
01 Apr 2023
ACM Transactions on Intelligent Systems and Technology | VOL. 14

Dimensionality and data reduction in telecom churn prediction
Wei-Chao Lin ... Chih-Fong Tsai
Kybernetes | VOL. 43
Wei-Chao Lin, et. al.Wei-Chao Lin ... Chih-Fong Tsai
29 Apr 2014
Kybernetes | VOL. 43

Locally alignment based manifold learning for simultaneous feature selection and extraction in classification problems
Mahboubeh Fattahi ... Yahya Forghani
Knowledge-Based Systems | VOL. 259
Mahboubeh Fattahi, et. al.Mahboubeh Fattahi ... Yahya Forghani
05 Nov 2022
Knowledge-Based Systems | VOL. 259

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TOFA: Trace Oriented Feature Analysis in Text Categorization

Abstract

Talk to us

Similar Papers