An Actor-Critic Method for Simulation-Based Optimization

Kuo Li,Qing-Shan Jia,Jiaqi Yan

doi:10.1016/j.ifacol.2022.08.040

Abstract

In this work, we study simulation-based optimization, where the agent aims to select the best configuration from the design space with as few as possible iterations. Inspired by the success of deep reinforcement learning (DRL), we formulate the sampling process as policy searching and give a solving method from the perspective of policy iteration. Concretely, a surrogate model for predicting the performance of each configuration and a parameterized sampling policy are applied, which correspond to the critic and actor in actor-critic (AC) method, respectively. We further derive the updating rule and propose two algorithms for configuration selection in continuous and discrete design spaces, respectively. Finally, the algorithms are validated experimentally on 1) two toy examples to intuitively explain the principle and 2) two high-dimensional tasks to reveal the effectiveness in large-scale problems. The results show that the proposed algorithms can efficiently deal with large-scale problems and effectively eliminate sub-optimal configurations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Actor-Critic Method for Simulation-Based Optimization

Abstract

Talk to us

Similar Papers

More From: IFAC-PapersOnLine

Lead the way for us

Journal: IFAC-PapersOnLine	Publication Date: Jan 1, 2022
Citations: 1

Similar Papers

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

이산설계공간에서 직교배열표를 이용한 순차적 알고리듬의 국부해
...
Transactions of the Korean Society of Mechanical Engineers A | VOL. 28
, et. al. ...
01 Sep 2004
Transactions of the Korean Society of Mechanical Engineers A | VOL. 28

Deep reinforcement learning in computer vision: a comprehensive survey
Ngan Le ... Vidhiwar Singh Rathour
Artificial Intelligence Review | VOL. 55
Ngan Le, et. al.Ngan Le ... Vidhiwar Singh Rathour
29 Sep 2021
Artificial Intelligence Review | VOL. 55

An optimization algorithm using orthogonal arrays in discrete design space for structures
Kwon-Hee Lee ... Gyung-Jin Park
Finite Elements in Analysis and Design | VOL. 40
Kwon-Hee Lee, et. al.Kwon-Hee Lee ... Gyung-Jin Park
19 Jun 2003
Finite Elements in Analysis and Design | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Actor-Critic Method for Simulation-Based Optimization

Abstract

Talk to us

Similar Papers

More From: IFAC-PapersOnLine