Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data

U Bhowan,M Johnston,Mengjie Zhang Mengjie Zhang

doi:10.1109/tsmcb.2011.2167144

Abstract

Machine learning algorithms such as genetic programming (GP) can evolve biased classifiers when data sets are unbalanced. Data sets are unbalanced when at least one class is represented by only a small number of training examples (called the minority class) while other classes make up the majority. In this scenario, classifiers can have good accuracy on the majority class but very poor accuracy on the minority class(es) due to the influence that the larger majority class has on traditional training criteria in the fitness function. This paper aims to both highlight the limitations of the current GP approaches in this area and develop several new fitness functions for binary classification with unbalanced data. Using a range of real-world classification problems with class imbalance, we empirically show that these new fitness functions evolve classifiers with good performance on both the minority and majority classes. Our approaches use the original unbalanced training data in the GP learning process, without the need to artificially balance the training examples from the two classes (e.g., via sampling).

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)

Lead the way for us

Journal: IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)	Publication Date: Sep 26, 2011
Citations: 150

Similar Papers

Fitness Functions in Genetic Programming for Classification with Unbalanced Data
Mengjie Zhang ... Grant Patterson
-
Mengjie Zhang, et. al.Mengjie Zhang ... Grant Patterson
02 Dec 2007
02 Dec 2007

Improving Fitness Functions in Genetic Programming for Classification on Unbalanced Credit Card Data
Nhien-An Le-Khac ... Miguel Nicolau
-
Nhien-An Le-Khac, et. al.Nhien-An Le-Khac ... Miguel Nicolau
01 Jan 2015
01 Jan 2015

Evolving Diverse Ensembles Using Genetic Programming for Classification With Unbalanced Data
Mark Johnston ... Urvesh Bhowan
IEEE Transactions on Evolutionary Computation | VOL. 17
Mark Johnston, et. al.Mark Johnston ... Urvesh Bhowan
01 Jun 2013
IEEE Transactions on Evolutionary Computation | VOL. 17

Differentiating between individual class performance in Genetic Programming fitness for classification with unbalanced data
Mengjie Zhang ... Mark Johnston
-
Mengjie Zhang, et. al.Mengjie Zhang ... Mark Johnston
01 May 2009
01 May 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)