On the cost-effectiveness of neural and non-neural approaches and representations for text classification: A comprehensive comparative study

Washington Cunha,Vítor Mangaravite,Christian Gomes,Sérgio Canuto,Elaine Resende,Cecilia Nascimento,Felipe Viegas,Celso França,Wellington Santos Martins,Jussara M Almeida,Thierson Rosa,Leonardo Rocha,Marcos André Gonçalves

doi:10.1016/j.ipm.2020.102481

Abstract

This article brings two major contributions. First, we present the results of a critical analysis of recent scientific articles about neural and non-neural approaches and representations for automatic text classification (ATC). This analysis is focused on assessing the scientific rigor of such studies. It reveals a profusion of potential issues related to the experimental procedures including: (i) use of inadequate experimental protocols, including no repetitions for the sake of assessing variability and generalization; (ii) lack of statistical treatment of the results; (iii) lack of details on hyperparameter tuning, especially of the baselines; (iv) use of inadequate measures of classification effectiveness (e.g., accuracy with skewed distributions). Second, we provide some organization and ground to the field by performing a comprehensive and scientifically sound comparison of recent neural and non-neural ATC solutions. Our study provides a more complete picture by looking beyond classification effectiveness, taking the trade-off between model costs (i.e., training time) into account. Our evaluation is guided by scientific rigor, which, as our literature review shows, is missing in a large body of work. Our experimental results, based on more than 1500 measurements, reveal that in the smaller datasets, the simplest and cheaper non-neural methods are among the best performers. In the larger datasets, neural Transformers perform better in terms of classification effectiveness. However, when compared to the best (properly tuned) non-neural solutions, the gains in effectiveness are not very expressive, especially considering the much longer training times (up to 23x slower). Our findings call for a self-reflection of best practices in the field, from the way experiments are conducted and analyzed to the choice of proper baselines for each situation and scenario.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the cost-effectiveness of neural and non-neural approaches and representations for text classification: A comprehensive comparative study

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Journal: Information Processing & Management	Publication Date: Feb 5, 2021
Citations: 43

Similar Papers

Analysis on the Deep Learning Method is Used for Text Classification
Zhuocheng Zhang
-
Zhuocheng ZhangZhuocheng Zhang
01 Oct 2021
01 Oct 2021

Automatic Classification of Government Texts Based on Improved CNN and Skip-gram Models
Mingxi Wen ... Hao Wu
-
Mingxi Wen, et. al.Mingxi Wen ... Hao Wu
01 Nov 2021
01 Nov 2021

Editor's evaluation: Position representations of moving objects align with real-time position in the early visual response
Clare Press
-
Clare PressClare Press
28 Sep 2022
28 Sep 2022

Author response: Position representations of moving objects align with real-time position in the early visual response
Philippa Anne Johnson ... Tessel Blom
-
Philippa Anne Johnson, et. al.Philippa Anne Johnson ... Tessel Blom
09 Nov 2022
09 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the cost-effectiveness of neural and non-neural approaches and representations for text classification: A comprehensive comparative study

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management