Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks

Arman Iranfar,Marina Zapater,David Atienza

doi:10.1109/tcad.2021.3077193

Abstract

Nowadays, deep convolutional neural networks (DCNNs) play a significant role in many application domains, such as computer vision, medical imaging, and image processing. Nonetheless, designing a DCNN, able to defeat the state of the art, is a manual, challenging, and time-consuming task, due to the extremely large design space, as a consequence of a large number of layers and their corresponding hyperparameters. In this work, we address the challenge of performing hyperparameter optimization of DCNNs through a novel multiagent reinforcement learning (MARL)-based approach, eliminating the human effort. In particular, we adapt <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -learning and define learning agents per layer to split the design space into independent smaller design subspaces such that each agent fine tunes the hyperparameters of the assigned layer concerning a global reward. Moreover, we provide a novel formation of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$Q$ </tex-math></inline-formula> -tables along with a new update rule that facilitates agents’ communication. Our MARL-based approach is data driven and able to consider an arbitrary set of design objectives and constraints. We apply our MARL-based solution to different well-known DCNNs, including GoogLeNet, VGG, and U-Net, and various datasets for image classification and semantic segmentation. Our results have shown that compared to the original CNNs, the MARL-based approach can reduce the model size, training time, and inference time by up to, respectively, <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$83\times $ </tex-math></inline-formula> , 52%, and 54% without any degradation in accuracy. Moreover, our approach is very competitive to state-of-the-art neural architecture search methods in terms of the designed CNN accuracy and its number of parameters while significantly reducing the optimization cost.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: May 3, 2021
Citations: 10

Similar Papers

A multi-perspective revisit to the optimization methods of Neural Architecture Search and Hyper-parameter optimization for non-federated and federated learning environments
Salabat Khan ... Do Hyuen Kim
Computers & Electrical Engineering | VOL. 110
Salabat Khan, et. al.Salabat Khan ... Do Hyuen Kim
20 Jul 2023
Computers & Electrical Engineering | VOL. 110

Simple-Encoded evolving convolutional neural network and its application to skin disease image classification
Xiaoyu He ... Xiaojing Wang
Swarm and Evolutionary Computation | VOL. 67
Xiaoyu He, et. al.Xiaoyu He ... Xiaojing Wang
01 Dec 2021
Swarm and Evolutionary Computation | VOL. 67

Evolutionary Neural Architecture Search Supporting Approximate Multipliers
Michal Pinos ... Lukas Sekanina
-
Michal Pinos, et. al.Michal Pinos ... Lukas Sekanina
01 Jan 2020
01 Jan 2020

A Multiobjective Genetic Algorithm to Evolving Local Interpretable Model-Agnostic Explanations for Deep Neural Networks in Image Classification
Bin Wang ... Bing Xue
IEEE transactions on evolutionary computation : a publication of the IEEE Neural Networks Council | VOL. 28
Bin Wang, et. al.Bin Wang ... Bing Xue
01 Aug 2024
IEEE transactions on evolutionary computation : a publication of the IEEE Neural Networks Council | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems