Instance spaces for machine learning classification

Mario A Muñoz,Kate Smith-Miles,Davaatseren Baatar,Laura Villanova

doi:10.1007/s10994-017-5629-5

Abstract

This paper tackles the issue of objective performance evaluation of machine learning classifiers, and the impact of the choice of test instances. Given that statistical properties or features of a dataset affect the difficulty of an instance for particular classification algorithms, we examine the diversity and quality of the UCI repository of test instances used by most machine learning researchers. We show how an instance space can be visualized, with each classification dataset represented as a point in the space. The instance space is constructed to reveal pockets of hard and easy instances, and enables the strengths and weaknesses of individual classifiers to be identified. Finally, we propose a methodology to generate new test instances with the aim of enriching the diversity of the instance space, enabling potentially greater insights than can be afforded by the current UCI repository.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Instance spaces for machine learning classification

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Dec 28, 2017
Citations: 108

Similar Papers

Generating new test instances by evolving in instance space
Kate Smith-Miles ... Simon Bowly
Computers & Operations Research | VOL. 63
Kate Smith-Miles, et. al.Kate Smith-Miles ... Simon Bowly
09 May 2015
Computers & Operations Research | VOL. 63

An Instance Space Analysis of Regression Problems
Mario Andrés Muñoz ... Gisele L Pappa
ACM Transactions on Knowledge Discovery from Data | VOL. 15
Mario Andrés Muñoz, et. al.Mario Andrés Muñoz ... Gisele L Pappa
27 Mar 2021
ACM Transactions on Knowledge Discovery from Data | VOL. 15

Evaluating regression algorithms at the instance level using item response theory
João V.C Moraes ... Ricardo B.C Prudêncio
Knowledge-Based Systems | VOL. 240
João V.C Moraes, et. al.João V.C Moraes ... Ricardo B.C Prudêncio
04 Jan 2022
Knowledge-Based Systems | VOL. 240

Revisiting where are the hard knapsack problems? via Instance Space Analysis
Kate Smith-Miles ... Mario Andrés Muñoz
Computers & Operations Research | VOL. 128
Kate Smith-Miles, et. al.Kate Smith-Miles ... Mario Andrés Muñoz
18 Dec 2020
Computers & Operations Research | VOL. 128

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Instance spaces for machine learning classification

Abstract

Talk to us

Similar Papers

More From: Machine Learning