Kernel-Based Ensemble Learning in Python

Benjamin Guedj,Bhargav Srinivasa Desikan

doi:10.3390/info11020063

Abstract

We propose a new supervised learning algorithm for classification and regression problems where two or more preliminary predictors are available. We introduce KernelCobra, a non-linear learning strategy for combining an arbitrary number of initial predictors. KernelCobra builds on the COBRA algorithm introduced by Biau et al. (2016), which combined estimators based on a notion of proximity of predictions on the training data. While the COBRA algorithm used a binary threshold to declare which training data were close and to be used, we generalise this idea by using a kernel to better encapsulate the proximity information. Such a smoothing kernel provides more representative weights to each of the training points which are used to build the aggregate and final predictor, and KernelCobra systematically outperforms the COBRA algorithm. While COBRA is intended for regression, KernelCobra deals with classification and regression. KernelCobra is included as part of the open source Python package Pycobra (0.2.4 and onward), introduced by Srinivasa Desikan (2018). Numerical experiments were undertaken to assess the performance (in terms of pure prediction and computational complexity) of KernelCobra on real-life and synthetic datasets.

Highlights

In the fields of machine learning and statistical learning, ensemble methods consist of combining several estimators to create a new, superior estimator
The KernelCobra algorithm introduced in the present paper aims to smoothen this data point selection process by introducing a kernel-based method to assigning weights to various points in the collective
We focus in the present paper on the introduction of KernelCobra and its variants, and its implementation in Python

Summary

Introduction

In the fields of machine learning and statistical learning, ensemble methods consist of combining several estimators (or predictors) to create a new, superior estimator. Our method (KernelCobra) extends the COBRA (standing for combined regression alternative) algorithm introduced by Biau et al [6]. The COBRA algorithm is motivated by the idea that non-linear, data-dependent techniques can provide flexibility not offered by existing (linear) ensemble methods. By using information of proximity between the training data and predictions on test data, training points are collected to perform the aggregate. The COBRA algorithm selects training points by checking whether the proximity is less than a data dependant threshold e, resulting in a binary decision (either keep the point or discard it). The KernelCobra algorithm introduced in the present paper aims to smoothen this data point selection process by introducing a kernel-based method to assigning weights to various points in the collective.

Related Work

KernelCobra: A Kernelized Version of COBRA

The Unsupervised Setting

Classification

Implementation

Numerical Experiments

Conclusions and Future Work

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Jan 25, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Kernel-Based Ensemble Learning in Python

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Adaptive application of machine learning models on separate segments of a data sample in regression and classification problems
Iliya Lebedev
Информационно-управляющие системы | VOL. -
Iliya LebedevIliya Lebedev
24 Jun 2022
Информационно-управляющие системы | VOL. -

Detecting the Presence and Concentration of Nitrate in Water Using Microwave Spectroscopy
Sean Cashman ... Olga Korostynska
IEEE Sensors Journal | VOL. 17
Sean Cashman, et. al.Sean Cashman ... Olga Korostynska
01 Jul 2017
IEEE Sensors Journal | VOL. 17

A Bayesian Perspective on Early Stage Event Prediction in Longitudinal Data
Mahtab Jahanbani Fard ... Ping Wang
IEEE Transactions on Knowledge and Data Engineering | VOL. 28
Mahtab Jahanbani Fard, et. al.Mahtab Jahanbani Fard ... Ping Wang
01 Dec 2016
IEEE Transactions on Knowledge and Data Engineering | VOL. 28

Reply on RC1
Femke Van Geffen
-
Femke Van GeffenFemke Van Geffen
22 Jun 2022
22 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel-Based Ensemble Learning in Python

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information