Abstract

Day by day the number of text documents in digital form is increasing. Text classification is used to organize these text documents. However, text classification has the problem of high dimensionality of feature space. This high dimensionality of feature space is solved by feature selection and feature extraction methods and improves the performance of text categorization. The feature selection and feature extraction techniques remove the irrelevant features from the text documents and reduce the dimensionality of feature space. This paper presents the various feature selection and feature extraction methods. This paper also discusses various classifiers to classify the documents.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call