On the influence of the number of algorithms, problems, and independent runs in the comparison of evolutionary algorithms

Niki Veček,Matej Črepinšek,Marjan Mernik

doi:10.1016/j.asoc.2017.01.011

Abstract

When conducting a comparison between multiple algorithms on multiple optimisation problems it is expected that the number of algorithms, problems and even the number of independent runs will affect the final conclusions. Our question in this research was to what extent do these three factors affect the conclusions of standard Null Hypothesis Significance Testing (NHST) and the conclusions of our novel method for comparison and ranking the Chess Rating System for Evolutionary Algorithms (CRS4EAs). An extensive experiment was conducted and the results were gathered and saved of k=16 algorithms on N=40 optimisation problems over n=100 runs. These results were then analysed in a way that shows how these three values affect the final results, how they affect ranking and which values provide unreliable results. The influence of the number of algorithms was examined for values k={4, 8, 12, 16}, number of problems for values N={5, 10, 20, 40}, and number of independent runs for values n={10, 30, 50, 100}. We were also interested in the comparison between both methods – NHST's Friedman test with post-hoc Nemenyi test and CRS4EAs – to see if one of them has advantages over the other. Whilst the conclusions after analysing the values of k were pretty similar, this research showed that the wrong value of N can give unreliable results when analysing with the Friedman test. The Friedman test does not detect any or detects only a small number of significant differences for small values of N and the CRS4EAs does not have a problem with that. We have also shown that CRS4EAs is an appropriate method when only a small number of independent runs n are available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the influence of the number of algorithms, problems, and independent runs in the comparison of evolutionary algorithms

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jan 12, 2017
Citations: 46

Similar Papers

A chess rating system for evolutionary algorithms: A new method for the comparison and ranking of evolutionary algorithms
Niki Veček ... Matej Črepinšek
Information Sciences | VOL. 277
Niki Veček, et. al.Niki Veček ... Matej Črepinšek
13 Mar 2014
Information Sciences | VOL. 277

A Comparison between Different Chess Rating Systems for Ranking Evolutionary Algorithms
Niki Veček ... Dejan Hrnčič
-
Niki Veček, et. al.Niki Veček ... Dejan Hrnčič
29 Sep 2014
29 Sep 2014

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers
Daniel Berrar
Machine Learning | VOL. 106
Daniel BerrarDaniel Berrar
30 Dec 2016
Machine Learning | VOL. 106

Bayesian alternatives to null-hypothesis significance testing for repeated-measures designs
Farouk S Nathoo ... Michael E.J Masson
Journal of Mathematical Psychology | VOL. 72
Farouk S Nathoo, et. al.Farouk S Nathoo ... Michael E.J Masson
08 Apr 2015
Journal of Mathematical Psychology | VOL. 72

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the influence of the number of algorithms, problems, and independent runs in the comparison of evolutionary algorithms

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing