Effective hyperparameter optimization using Nelder-Mead method in deep learning

Yoshihiko Ozaki,Masaki Onishi,Masaki Yano

doi:10.1186/s41074-017-0030-7

Abstract

In deep learning, deep neural network (DNN) hyperparameters can severely affect network performance. Currently, such hyperparameters are frequently optimized by several methods, such as Bayesian optimization and the covariance matrix adaptation evolution strategy. However, it is difficult for non-experts to employ these methods. In this paper, we adapted the simpler coordinate-search and Nelder-Mead methods to optimize hyperparameters. Several hyperparameter optimization methods were compared by configuring DNNs for character recognition and age/gender classification. Numerical results demonstrated that the Nelder-Mead method outperforms the other methods and achieves state-of-the-art accuracy for age/gender classification.

Highlights

The evolution of deep neural networks (DNNs) has dramatically improved the accuracy of character recognition [1], object recognition [2, 3], and other tasks
A hyperparameter optimization problem can be formulated as a stochastic black box optimization problem to minimize a noisy black box objective function f (x): Minimize f (x) (x ∈ χ)
The results demonstrated that Bayesian optimization outperforms manual search by a human expert and random search [7, 8]

Summary

Introduction

The evolution of deep neural networks (DNNs) has dramatically improved the accuracy of character recognition [1], object recognition [2, 3], and other tasks. The their increasing complexity increases the number of hyperparameters, which makes tuning of hyperparameters an intractable task. Search space expands exponentially relative to the number of hyperparameters; such naive methods no longer work well. More sophisticated hyperparameter optimization methods are required. A hyperparameter optimization problem can be formulated as a stochastic black box optimization problem to minimize a noisy black box objective function f (x): Minimize f (x) (x ∈ χ)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IPSJ Transactions on Computer Vision and Applications	Publication Date: Nov 10, 2017
Citations: 44	License type: open-access

R Discovery Prime

R Discovery Prime

Effective hyperparameter optimization using Nelder-Mead method in deep learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IPSJ Transactions on Computer Vision and Applications

Lead the way for us

Similar Papers

Towards an efficient validation of dynamical whole-brain models
Kevin J Wischnewski ... Oleksandr V Popovych
Scientific Reports | VOL. 12
Kevin J Wischnewski, et. al.Kevin J Wischnewski ... Oleksandr V Popovych
14 Mar 2022
Scientific Reports | VOL. 12

Deep learning algorithm for data classification with hyperparameter optimization method
T Badriyah ... D B Santoso
Journal of Physics: Conference Series | VOL. 1193
T Badriyah, et. al.T Badriyah ... D B Santoso
01 Apr 2019
Journal of Physics: Conference Series | VOL. 1193

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.
Xia Jiang ... Chuhan Xu
Journal of Clinical Medicine | VOL. 11
Xia Jiang, et. al.Xia Jiang ... Chuhan Xu
29 Sep 2022
Journal of Clinical Medicine | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effective hyperparameter optimization using Nelder-Mead method in deep learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IPSJ Transactions on Computer Vision and Applications