Abstract

Studies on the traditional support vector machine (SVM) implicitly assume that the costs of different types of mistakes are the same and minimize the error rate. On the one hand, it is not enough for many practical applications to rely solely on the error rate, which reflects only the average classification ability of a classifier. It is also of great significance to consider the performance of classifiers from the perspective of each sample. On the other hand, many real-world problems, such as credit card fraud detection, intrusion detection, oil-spill detection and cancer diagnosis, usually involve substantially unequal misclassification costs. To solve this problem, many works on the cost-sensitive SVM (CS-SVM) have emerged. The misclassification costs for this model are generally provided by domain experts. Inspired by the concept of the CS-SVM, we propose a new SVM with sample-based misclassification cost invariance with the aim of constructing a relatively reliable classifier. The relatively reliable classifier is defined as the one with low probabilities of finding a classifier that correctly classifies each misclassified sample. Note that the cost is determined by the inherent characteristics of each sample rather than being subjectively assigned, so we denote the proposed classifier as the objective-cost-sensitive SVM (OCS-SVM). The experimental results demonstrate the superiority of the proposed method compared with nine other commonly used classifiers.

Highlights

  • In the field of data classification, the objective of conventional machine learning techniques is to minimize a loss function on a training set to obtain lower misclassification rates [1]

  • It should be noted that the misclassification costs are generally provided by domain experts, so we summarize them as subjective costs and the cost-sensitive support vector machine (SVM) based on this cost is called the subjective-cost-sensitive SVM (SCS-SVM)

  • We aim to find the optimal parameter pair (C, sigma) that minimizes the misclassification cost, thereby achieving the objective of the objective-cost-sensitive SVM (OCS-SVM) to reduce the misclassification costs, and making the OCS-SVM algorithm cost-sensitive to these two parameters

Read more

Summary

INTRODUCTION

In the field of data classification, the objective of conventional machine learning techniques is to minimize a loss function on a training set to obtain lower misclassification rates [1]. The SVM has been widely used in a variety of practical classification tasks, such as disease diagnosis [7], bioinformatics [8], intrusion detection [9], and so forth Most of these studies implicitly assume that the costs of different types of mistakes are the same and minimize the error rate. Inspired by the concept of SCS-SVMs, we propose a new SVM with sample-based misclassification cost invariance with the aim of constructing a relatively reliable classifier. (3) A novel classifier with reliability (OCS-SVM) is proposed, which is defined as the one with low probabilities of finding a classifier that correctly classifies the misclassified samples.

RELATED WORKS
OBJECTIVE
CLASSIFIER WITH RELIABILITY
THE OCS-SVM ALGORITHM
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call