A stochastic approach to handle resource constraints as knapsack problems in ensemble pruning

András Hajdu,György Terdik,Henrietta Tomán,Attila Tiba

doi:10.1007/s10994-021-06109-0

Abstract

Ensemble-based methods are highly popular approaches that increase the accuracy of a decision by aggregating the opinions of individual voters. The common point is to maximize accuracy; however, a natural limitation occurs if incremental costs are also assigned to the individual voters. Consequently, we investigate creating ensembles under an additional constraint on the total cost of the members. This task can be formulated as a knapsack problem, where the energy is the ensemble accuracy formed by some aggregation rules. However, the generally applied aggregation rules lead to a nonseparable energy function, which takes the common solution tools—such as dynamic programming—out of action. We introduce a novel stochastic approach that considers the energy as the joint probability function of the member accuracies. This type of knowledge can be efficiently incorporated in a stochastic search process as a stopping rule, since we have the information on the expected accuracy or, alternatively, the probability of finding more accurate ensembles. Experimental analyses of the created ensembles of pattern classifiers and object detectors confirm the efficiency of our approach over other pruning ones. Moreover, we propose a novel stochastic search method that better fits the energy, which can be incorporated in other stochastic strategies as well.

Highlights

Ensemble-based systems are rather popular in several application fields and are employed to increase the decision accuracy of individual approaches
We show that this type of knowledge can be efficiently incorporated in any stochastic search process as a stopping rule, since we have the information on the expected accuracy or, alternatively, the probability of finding more accurate ensembles
We estimate the distribution of q in terms of its mean and variance. This information can be efficiently incorporated as a stopping rule in stochastic search algorithms, as we demonstrate it e.g. for simulated annealing (SA)

Summary

Introduction

Ensemble-based systems are rather popular in several application fields and are employed to increase the decision accuracy of individual approaches. There are efforts to complement the basic ensemble pruning models to consider possible resource constraints like training/test execution times or memory/storage space (Bucilu et al 2006; Hinton et al 2015) as well. To reach this aim a popular approach is to apply multi-objective evolutionary algorithms, like NSGA-II (Deb et al 2002). Besides its individual accuracy and cost, we calculate such a usefulness value for each possible member during the selection process that reflects its direct behavior according to the objective function, which is based on the majority voting rule in our case.

Basic concepts and notation

Deterministic selection strategies

Stochastic search algorithms

Stochastic estimation of ensemble energy

Estimation of the distribution of member accuracies

Stopping rule for ensemble selection

Empirical analysis

Kaggle challenges

Binary classification problems

Discussion

Findings

The variance of q is expressed by

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A stochastic approach to handle resource constraints as knapsack problems in ensemble pruning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Nov 18, 2021
License type: open-access

Similar Papers

Hybrid Approach for Multiobjective Optimization and Its Application to Process Engineering Problems
S. Sharma ... G. P. Rangaiah
-
S. Sharma, et. al.S. Sharma ... G. P. Rangaiah
01 Jan 2014
01 Jan 2014

Instance-Based Ensemble Pruning via Multi-Label Classification
Fotini Markatopoulou ... Ioannis Vlahavas
-
Fotini Markatopoulou, et. al.Fotini Markatopoulou ... Ioannis Vlahavas
01 Oct 2010
01 Oct 2010

Improvement of the branch and bound algorithm for solving the knapsack linear integer problem
Elias Munapo
Eastern-European Journal of Enterprise Technologies | VOL. 2
Elias MunapoElias Munapo
30 Apr 2020
Eastern-European Journal of Enterprise Technologies | VOL. 2

Dynamic Stochastic Ranking Selection Immune Optimization Algorithm for Dynamical 0/1 Knapsack Problem
Qian Shu-Qu ... Wu Hui-Hong
-
Qian Shu-Qu, et. al.Qian Shu-Qu ... Wu Hui-Hong
01 Aug 2013
01 Aug 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A stochastic approach to handle resource constraints as knapsack problems in ensemble pruning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning