This paper studies the problem of constructing a robust nonlinear classifier when the data set involves uncertainty and only the first- and second-order moments are known a priori. A distributionally robust chance-constrained kernel-free quadratic surface support vector machine (SVM) model is proposed using the moment information of the uncertain data. The proposed model is reformulated as a semidefinite programming problem and a second-order cone programming problem for efficient computations. A geometric interpretation of the proposed model is also provided. For commonly used data without prescribed uncertainty, a cluster-based data-driven approach is introduced to retrieve the hidden moment information that enables the proposed model for robust classification. Extensive computational experiments using synthetic and public benchmark data sets with or without uncertainty involved support the superior performance of the proposed model over other state-of-the-art SVM models, particularly when the data sets are massive and/or imbalanced.
Read full abstract