MDLText: An efficient and lightweight text classifier

Renato M Silva,Tiago A Almeida,Akebo Yamakami

doi:10.1016/j.knosys.2016.11.018

Abstract

In many areas, the volume of text information is increasing rapidly, thereby demanding efficient text classification approaches. Several methods are available at present, but most exhibit declining performance as the dimensionality of the problem increases, or they incur high computational costs for training, which limit their application in real scenarios. Thus, it is necessary to develop a method that can process high dimensional data in a rapid manner. In this study, we propose the MDLText, an efficient, lightweight, scalable, and fast multinomial text classifier, which is based on the minimum description length principle. MDLText exhibits fast incremental learning as well as being sufficiently robust to prevent overfitting, which are desirable features in real-world applications, large-scale problems, and online scenarios. Our experiments were carefully designed to ensure that we obtained statistically sound results, which demonstrated that the proposed approach achieves a good balance between predictive power and computational efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MDLText: An efficient and lightweight text classifier

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Nov 25, 2016
Citations: 26

Similar Papers

Au sujet de l’article : Petitdant B. Origines, histoire, évolutions de la mesure de la force de préhension et des dynamomètres médicaux. Kinesither Rev 2017;17(181):40–58
-
Kinésithérapie, la Revue | VOL. 17
--
01 Jul 2017
Kinésithérapie, la Revue | VOL. 17

ML-MDLText: A Multilabel Text Categorization Technique with Incremental Learning
Marciele M Bittencourt ... Renato M Silva
-
Marciele M Bittencourt, et. al.Marciele M Bittencourt ... Renato M Silva
01 Oct 2019
01 Oct 2019

Principle of representational minimum description length in image analysis and pattern recognition
A S Potapov
Pattern Recognition and Image Analysis | VOL. 22
A S PotapovA S Potapov
01 Mar 2012
Pattern Recognition and Image Analysis | VOL. 22

New paradigm of learnable computer vision algorithms based on the representational MDL principle
Alexey S Potapov ... Anton N Averkin
-
Alexey S Potapov, et. al.Alexey S Potapov ... Anton N Averkin
23 Apr 2010
23 Apr 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MDLText: An efficient and lightweight text classifier

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems