Two Challenges of Correct Validation in Pattern Recognition

Thomas Nowotny

doi:10.3389/frobt.2014.00005

Abstract

Supervised pattern recognition is the process of mapping patterns to class labels that define their meaning. The core methods for pattern recognition have been developed by machine learning experts but due to their broad success an increasing number of non-experts are now employing and refining them. In this perspective I will discuss the challenge of correct validation of supervised pattern recognition systems, in particular when employed by non-experts. To illustrate the problem I will give three examples of common errors that I have encountered in the last year. Much of this challenge can be addressed by strict procedure in validation but there are remaining problems of correctly interpreting comparative work on exemplary data sets, which I will elucidate on the example of the well-used MNIST data set of handwritten digits.

Highlights

Pattern recognition is the process of mapping input data, a pattern, to a label, the“class”to which the input pattern belongs
Among the common approaches to pattern recognition, supervised machine learning approaches have gained a lot of momentum with headline successes, e.g., on the MNIST data set of handwritten digits (LeCun and Cortes, 1998), which I will use for illustration later
It is on this background that I would like to highlight two aspects of validation and crossvalidation that do not seem to be fully appreciated in the larger community of applied machine learning practitioners: (i) the need for correct and strict procedure in validation or crossvalidation and (ii) the need for careful interpretation of validation results if multiple studies use the same reference data set

Summary

INTRODUCTION

Pattern recognition is the process of mapping input data, a pattern, to a label, the“class”to which the input pattern belongs. While there are still developments, e.g., in SVM technology (Huerta et al, 2012) and deep learning architectures, there is an increasing number of studies focusing on preprocessing of data, refining meta-parameters, and applying the established methods to novel real world applications This trend is driven by scientists who are not necessarily experts in machine learning but want to apply machine learning methods in their own application domain. It is on this background that I would like to highlight two aspects of validation and crossvalidation that do not seem to be fully appreciated in the larger community of applied machine learning practitioners: (i) the need for correct and strict procedure in validation or crossvalidation and (ii) the need for careful interpretation of validation results if multiple studies use the same reference data set. Both issues are different aspects of the same problem of overfitting and closely related, but while the former has known solutions that applied researchers can be informed about the latter is more involved and invites further research by machine learning experts

STRICT PROCEDURE OF VALIDATION IN INDIVIDUAL STUDIES

EXAMPLE 1

EXAMPLE 2

EXAMPLE 3

INTERPRETATION OF MULTIPLE STUDIES ON A COMMON DATA SET

Findings

DISCUSSION AND CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Robotics and AI	Publication Date: Sep 25, 2014
Citations: 21	License type: cc-by

R Discovery Prime

R Discovery Prime

Two Challenges of Correct Validation in Pattern Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI

Lead the way for us

Similar Papers

A Practical Study on Feature Selection Methods in Pattern Recognition: Examples of Handwritten Digits and Printed Musical Notation
Wladyslaw Homenda ... Agnieszka Jastrzebska
-
Wladyslaw Homenda, et. al.Wladyslaw Homenda ... Agnieszka Jastrzebska
01 Jul 2017
01 Jul 2017

Tackling temporal pattern recognition by vector space embedding
Brian Iwana ... Kaspar Riesen
-
Brian Iwana, et. al.Brian Iwana ... Kaspar Riesen
01 Aug 2015
01 Aug 2015

Learning of classification tasks with an array of uniform-mode spin Hall nano-oscillators
Utkarsh Singh ... Neha Garg
AIP Advances | VOL. 11
Utkarsh Singh, et. al.Utkarsh Singh ... Neha Garg
01 Apr 2021
AIP Advances | VOL. 11

Deep Spatiotemporal Feature Learning with Application to Image Classification
Thomas P Karnowski ... Itamar Arel
-
Thomas P Karnowski, et. al.Thomas P Karnowski ... Itamar Arel
01 Dec 2010
01 Dec 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two Challenges of Correct Validation in Pattern Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI