A hyper-parameter tuning approach for cost-sensitive support vector machine classifiers

Rosita Guido,Maria Carmela Groccia,Domenico Conforti

doi:10.1007/s00500-022-06768-8

Rosita Guido, Maria Carmela Groccia + Show 1 more

Open Access

https://doi.org/10.1007/s00500-022-06768-8

Copy DOI

Journal: Soft Computing	Publication Date: Feb 2, 2022
Citations: 17	License type: CC BY 4.0

Affiliation: University of Calabria

Abstract

AbstractIn machine learning, hyperparameter tuning is strongly useful to improve model performance. In our research, we concentrate our attention on classifying imbalanced data by cost-sensitive support vector machines. We propose a multi-objective approach that optimizes model’s hyper-parameters. The approach is devised for imbalanced data. Three SVM model’s performance measures are optimized. We present the algorithm in a basic version based on genetic algorithms, and as an improved version based on genetic algorithms combined with decision trees. We tested the basic and the improved approach on benchmark datasets either as serial and parallel version. The improved version strongly reduces the computational time needed for finding optimized hyper-parameters. The results empirically show that suitable evaluation measures should be used in assessing the classification performance of classification models with imbalanced data.

Highlights

Classification problems may be encountered in different domains
9: end while 10: return similarit y in the literature with our results. They are related to medical diagnosis represented as binary classification problems and have different sample sizes, attributes, and imbalance ratio (IR), defined as m/M (Amin et al 2016), where m is the number of the minority instances and M is the number of majority instances
As in other machine learning (ML) techniques, their performance depends on hyperparameters

Summary

Introduction

Classification problems may be encountered in different domains. One of these is the disease diagnosis, which establishes the presence or absence of a given disease according to referred symptoms and results of medical exams. We tested in (Guido et al 2021) two evaluation model metrics, i.e., accuracy and G-Mean, on two imbalanced benchmark datasets by optimizing hyper-parameters of support vector machines by genetic algorithms (GAs). They performed experimental analysis on class imbalance, cost-sensitive learning with a given class and example costs and showed that their proposed algorithm provides superior generalization performance compared to conventional methods. Qi et al (2013) proposed a new Cost-Sensitive Laplacian SVM and tested its effectiveness via experiments on public datasets They evaluate the algorithms performance by the Average Cost. Noia et al (2020) applied SVM, k-Nearest Neighbors and k-means as clustering techniques to predict the probability of contracting a given disease starting from both workplace-related (using Ateco and Istat codes) and workerrelated characteristics (i.e., age at hiring, age at disease certification, gender, employment duration) They used a GA to find the best values of the used methods. The most used evaluation measures are accuracy, precision, recall, F-score, and the Receiver Operating Characteristic (ROC)

Learning model classifiers

Support vector machine

Decision tree

Performance evaluation and some limitations

Multi-objective optimization problems and Genetic algorithms

NSGA-II

Proposed approach

Basic approach

Improved hyper-parameters algorithm

1: Step 1 Initialization 2

Experimental results and analysis

Benchmark datasets

Learning algorithms and hyperparameters optimization

Computational results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A hyper-parameter tuning approach for cost-sensitive support vector machine classifiers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Soft Computing

Lead the way for us

Similar Papers

A Method for Analyzing the Performance Impact of Imbalanced Binary Data on Machine Learning Models
Ming Zheng ... Yuhao Miao
Axioms | VOL. 11
Ming Zheng, et. al.Ming Zheng ... Yuhao Miao
01 Nov 2022
Axioms | VOL. 11

A Novel Classification Method Based on a Two-Phase Technique for Learning Imbalanced Text Data
Der-Chiang Li ... Szu-Chou Chen
Symmetry | VOL. 14
Der-Chiang Li, et. al.Der-Chiang Li ... Szu-Chou Chen
13 Mar 2022
Symmetry | VOL. 14

Lumpy Skin Disease Prediction Based on Meteorological and Geospatial Features using Random Forest Algorithm with Hyperparameter Tuning
Suparyati ... Alva Hendi Muhammad
-
Suparyati, et. al. Suparyati ... Alva Hendi Muhammad
24 Aug 2022
24 Aug 2022

A new classification strategy for human activity recognition using cost sensitive support vector machines for imbalanced data
Bilal M’Hamed Abidine ... Dr
Kybernetes | VOL. 43
Bilal M’Hamed Abidine, et. al.Bilal M’Hamed Abidine ... Dr
26 Aug 2014
Kybernetes | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A hyper-parameter tuning approach for cost-sensitive support vector machine classifiers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Soft Computing