Large Scale Kernel Methods for Online AUC Maximization

Yi Ding,Steven C.H Hoi,Peilin Zhao,Chenghao Liu

doi:10.1109/icdm.2017.18

Abstract

Learning to optimize AUC performance for classifying label imbalanced data in online scenarios has been extensively studied in recent years. Most of the existing work has attempted to address the problem directly in the original feature space, which may not suitable for non-linearly separable datasets. To solve this issue, some kernel-based learning methods are proposed for non-linearly separable datasets. However, such kernel approaches have been shown to be inefficient and failed to scale well on large scale datasets in practice. Taking this cue, in this work, we explore the use of scalable kernel-based learning techniques as surrogates to existing approaches: random Fourier features and Nystrom method, for tackling the problem and bring insights to the differences between the two methods based on their online performance. In contrast to the conventional kernel-based learning methods which suffer from high computational complexity of the kernel matrix, our proposed approaches elevate this issue with linear features that approximate the kernel function/matrix. Specifically, two different surrogate kernel-based learning models are presented for addressing the online AUC maximization task: (i) the Fourier Online AUC Maximization (FOAM) algorithm that samples the basis functions from a data-independent distribution to approximate the kernel functions; and (ii) the Nystrom Online AUC Maximization (NOAM) algorithm that samples a subset of instances from the training data to approximate the kernel matrix by a low rank matrix. Another novelty of the present work is the proposed mini-batch Online Gradient Descent method for model updating to control the noise and reduce the variance of gradients. We provide theoretical analyses for the two proposed algorithms. Empirical studies on commonly used large scale datasets show that the proposed algorithms outperformed existing state-of-the-art methods in terms of both AUC performance and computational efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Large Scale Kernel Methods for Online AUC Maximization

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Nov 1, 2017
Citations: 55	License type: cc-by-nc-nd

Similar Papers

A maximum entropy-driven support vector classification model for seismic collapse fragility curves estimation of reinforced concrete frame structures
Yu Zhou ... Huan Luo
Structures | VOL. 65
Yu Zhou, et. al.Yu Zhou ... Huan Luo
17 Jun 2024
Structures | VOL. 65

Multi-robot Formation Control with Kernel-based Reinforcement Learning
Jun Wu ... Xin Xu
ROBOT | VOL. 33
Jun Wu, et. al.Jun Wu ... Xin Xu
05 Aug 2011
ROBOT | VOL. 33

Reconstructing seismic response demands across multiple tall buildings using kernel‐based machine learning methods
Han Sun ... John Wallace
Structural Control and Health Monitoring | VOL. 26
Han Sun, et. al.Han Sun ... John Wallace
23 Apr 2019
Structural Control and Health Monitoring | VOL. 26

Fast kernel independent component analysis with Nyström method
He Wang ... Weixia Xu
-
He Wang, et. al.He Wang ... Weixia Xu
01 Nov 2016
01 Nov 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large Scale Kernel Methods for Online AUC Maximization

Abstract

Talk to us

Similar Papers