Learning algorithm selection for comprehensible regression analysis using datasetoids

Gert Loterman,Christophe Mues

doi:10.3233/ida-150756

Abstract

Data mining tools often include a workbench of algorithms to model a given dataset but lack sufficient guidance to select the most accurate algorithm given a certain dataset. The best algorithm is not known in advance and no single model format is superior for all datasets. Evaluating a number of candidate algorithms on large datasets to determine the most accurate model is however a computational burden. An alternative and more time efficient way is to select the optimal algorithm based on the nature of the dataset. In this meta-learning study, it is explored to what degree dataset characteristics can help identify which regression/estimation algorithm will best fit a given dataset. We chose to focus on comprehensible `white-box' techniques in particular (i.e. linear, spline, tree, linear tree or spline tree) as those are of particular interest in many real-life estimation settings. A large scale experiment with more than thousand so called datasetoids representing various real-life dependencies is conducted to discover possible relations. It is found that algorithm based characteristics such as sampling landmarks are major drivers for successfully selecting the most accurate algorithm. Further, it is found that data based characteristics such as the length, dimensionality and composition of the independent variables, or the asymmetry and dispersion of the dependent variable appear to contribute little once landmarks are included in the meta-model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning algorithm selection for comprehensible regression analysis using datasetoids

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis

Lead the way for us

Journal: Intelligent Data Analysis	Publication Date: Sep 8, 2015
Citations: 2

Similar Papers

Selecting Accurate and Comprehensible Regression Algorithms through Meta Learning
Gert Loterman ... Christophe Mues
-
Gert Loterman, et. al.Gert Loterman ... Christophe Mues
01 Dec 2012
01 Dec 2012

On Accurate Discrete-Time Dynamic Models of an Induction Machine
Ramón Herrera Hernández ... María Coronel
Mathematics | VOL. 12
Ramón Herrera Hernández, et. al.Ramón Herrera Hernández ... María Coronel
24 Jan 2024
Mathematics | VOL. 12

Accurate Modeling of Prismatic Type High Current Lithium-Iron-Phosophate (LiFePO4) Battery for Automotive Applications
Farag K Abo-Elyousr ... G H Rim
Energy and Power Engineering | VOL. 04
Farag K Abo-Elyousr, et. al.Farag K Abo-Elyousr ... G H Rim
01 Jan 2012
Energy and Power Engineering | VOL. 04

Multi-scale physics-informed machine learning using the Buckingham Pi theorem
Michael W Oppenheimer ... Justin D Merrick
Journal of Computational Physics | VOL. 474
Michael W Oppenheimer, et. al.Michael W Oppenheimer ... Justin D Merrick
01 Dec 2022
Journal of Computational Physics | VOL. 474

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning algorithm selection for comprehensible regression analysis using datasetoids

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis