Abstract

Support vector machines (SVMs) are supervised learning models traditionally employed for classification and regression analysis. In classification analysis, a set of training data is chosen, and each instance in the training data is assigned a categorical class. An SVM then constructs a model based on a separating plane that maximizes the margin between different classes. Despite being one of the most popular classification models because of its strong performance empirically, understanding the knowledge captured in an SVM remains difficult. SVMs are typically applied in a black-box manner where the details of parameter tuning, training, and even the final constructed model are hidden from the users. This is natural since these details are often complex and difficult to understand without proper visualization tools. However, such an approach often brings about various problems including trial-and-error tuning and suspicious users who are forced to trust these models blindly.The contribution of this paper is a visual analysis approach for building SVMs in an open-box manner. Our goal is to improve an analyst’s understanding of the SVM modeling process through a suite of visualization techniques that allow users to have full interactive visual control over the entire SVM training process. Our visual exploration tools have been developed to enable intuitive parameter tuning, training data manipulation, and rule extraction as part of the SVM training process. To demonstrate the efficacy of our approach, we conduct a case study using a real-world robot control dataset.

Highlights

  • A support vector machine (SVM) [1] is a supervised learning method widely used in a variety of application areas, such as text analysis [2], computer vision [3], and bioinformatics [4, 5]

  • While SVMs have been shown to have high accuracy in classification [6], they face a variety of challenges when we want to use them for data analytics

  • Q3: How can we help the user to interpret and manipulate the prediction results in a user-friendly way? This paper presents our efforts in opening the black box of model building and knowledge extraction from SVMs

Read more

Summary

Introduction

A support vector machine (SVM) [1] is a supervised learning method widely used in a variety of application areas, such as text analysis [2], computer vision [3], and bioinformatics [4, 5]. Other outputs that can be retrieved from the trained SVM model are a vector that represents the feature weights, and a set of training data instances called support vectors These outputs are unable to provide any insights for domain users who want to better understand. Previous work by Tzeng and Ma [7] indicates that users can gain much insight if allowed to apply function-based techniques that can be explained and validated Such function-based methods enable the interpretation of the classification process with respect to the application domain. The importance of gaining such insight has motivated data mining algorithms [8, 9] that try to extract if--structured rules from the classification model Another issue that makes gaining insight from SVMs difficult is the use of non-linear kernels [10] which typically improve the classification accuracy.

Support vector machines
Visual exploration of high-dimensional data
Visual classification
An introduction to SVM classification
EasySVM
Open-box visual modeling of linear SVM
Visualization of training data and the SVM model
Visual exploration of the projected scatterplot
Visual local SVM building
Visual exploration of decision boundaries
Visual local model building process
Visualization and interactions of multiple models
Visual rule extraction
System implementation
Wall-following Robot Navigation dataset
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call