Text Classification of Cornell Movie Data using Data Mining with Feature Selection

A K Shrivas,Amit Kumar Dewangan,S M Ghosh

doi:10.35940/ijeat.b2329.129219

Abstract

Text Classification is branch of text mining through which we can analyze the sentiment of the movie data. In this research paper we have applied different preprocessing techniques to reduce the features from cornell movie data set. We have also applied the Correlation-based feature subset selection and chi-square feature selection technique for gathering most valuable words of each category in text mining processes. The new cornell movie data set formed after applying the preprocessing steps and feature selection techniques. We have classified the cornell movie data as positive or negative using various classifiers like Support Vector Machine (SVM), Multilayer Perceptron (MLP), Naive Bayes (NB), Bays Net (BN) and Random Forest (RF) classifier. We have also compared the classification accuracy among classifiers and achieved better accuracy i. e. 87% in case of SVM classifier with reduced number of features. The suggested classifier can be useful in opinion of movie review, analysis of any blog and documents etc.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text Classification of Cornell Movie Data using Data Mining with Feature Selection

Abstract

Talk to us

Similar Papers

More From: International Journal of Engineering and Advanced Technology

Lead the way for us

Similar Papers

Feature Selection Techniques and Classification Accuracy of Supervised Machine Learning in Text Mining
...
Journal of Information Engineering and Applications | VOL. 9
, et. al. ...
01 May 2019
Journal of Information Engineering and Applications | VOL. 9

A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
Samuel Mcmurray ... Ali Hassan Sodhro
Sensors | VOL. 23
Samuel Mcmurray, et. al.Samuel Mcmurray ... Ali Hassan Sodhro
26 Mar 2023
Sensors | VOL. 23

Prediction of Thyroid Disease(Hypothyroid) in Early Stage Using Feature Selection and Classification Techniques
Md Riajuliislam ... Khandakar Zahidur Rahim
-
Md Riajuliislam, et. al.Md Riajuliislam ... Khandakar Zahidur Rahim
27 Feb 2021
27 Feb 2021

A support vector machine classifier reduces interscanner variation in the HRCT classification of regional disease pattern in diffuse lung disease: Comparison to a Bayesian classifier
Yongjun Chang ... Jonghyuck Lim
Medical Physics | VOL. 40
Yongjun Chang, et. al.Yongjun Chang ... Jonghyuck Lim
24 Apr 2013
Medical Physics | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text Classification of Cornell Movie Data using Data Mining with Feature Selection

Abstract

Talk to us

Similar Papers

More From: International Journal of Engineering and Advanced Technology