Improving the precision-recall trade-off in undersampling-based binary text categorization using unanimity rule

Zafer Erenel,Hakan Altınçay

doi:10.1007/s00521-012-1056-5

Abstract

The distribution of documents over two classes in binary text categorization problem is generally uneven where resampling approaches are shown to improve F1 scores. The improvement achieved is mainly due to the gain in recall where precision may deteriorate. Since precision is the primary concern in some applications, achieving higher F1 scores with a desired level of trade-off between precision and recall is important. In this study, we present an analytical comparison between unanimity and majority voting rules. It is shown that unanimity rule can provide better F1 scores compared to majority voting when an ensemble of high recall but low precision classifiers is considered. Then, category-based undersampling is proposed to generate high recall members. The experiments conducted on three datasets have shown that superior F1 scores can be realized compared to the support vector machines(SVM)-based baseline system and voting over a random undersampling-based ensemble.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving the precision-recall trade-off in undersampling-based binary text categorization using unanimity rule

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Jul 11, 2012
Citations: 77

Similar Papers

Multiple Support Vector Machines for Binary Text Classification Based on Sliding Window Technique
Aisha Rashed Albqmi ... Yue Xu
-
Aisha Rashed Albqmi, et. al.Aisha Rashed Albqmi ... Yue Xu
01 Jan 2019
01 Jan 2019

How Group Size and Decision Rules Impact Risk Preferences: Comparing group and individual settings in lottery-choice experiments
Masao Fukutomi ... Yohei Mitani
Journal of Behavioral and Experimental Economics | VOL. 98
Masao Fukutomi, et. al.Masao Fukutomi ... Yohei Mitani
23 Mar 2022
Journal of Behavioral and Experimental Economics | VOL. 98

A Theory of Organizational Dynamics: Internal Politics and Efficiency
Hongbin Cai ... Hong Feng
SSRN Electronic Journal | VOL. -
Hongbin Cai, et. al.Hongbin Cai ... Hong Feng
10 May 2007
SSRN Electronic Journal | VOL. -

Variable Competence and Collective Performance: Unanimity Versus Simple Majority Rule
Eyal Baharad ... Shmuel Nitzan
Group Decision and Negotiation | VOL. 29
Eyal Baharad, et. al.Eyal Baharad ... Shmuel Nitzan
15 Nov 2019
Group Decision and Negotiation | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the precision-recall trade-off in undersampling-based binary text categorization using unanimity rule

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications