Application of Logistic Regression with part-of-the-speech tagging for multi-class text classification

Tomas Pranckevicius,Virginijus Marcinkevicius

doi:10.1109/aieee.2016.7821805

Application of Logistic Regression with part-of-the-speech tagging for multi-class text classification

Tomas Pranckevicius, Virginijus Marcinkevicius

https://doi.org/10.1109/aieee.2016.7821805

Copy DOI

Publication Date: Nov 1, 2016

Citations: 36

#Evaluating Classification Accuracy #Number Of N-grams + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Today, computing environment provides the possibility of carrying out various data-intensive natural language processing tasks. Language tokenization methods applied for multi-class text classification are recently investigated by many data scientists. The authors of this paper investigate Logistic Regression method by evaluating classification accuracy which correlates on the size of the training data, POS and number of n-grams. Logistic Regression method is implemented in Apache Spark, the in-memory intensive computing platform. Experimental results have shown that applied multi-class classification method for Amazon product-review data using POS features has higher classification accuracy.

Full Text