Deterministic global optimization with Gaussian processes embedded

Artur M Schweidtmann,Tim Kerkenhoff,Jaromił Najman,Dominik Bongartz,Xiaopeng Lin,Alexander Mitsos,Daniel Grothe

doi:10.1007/s12532-021-00204-y

Artur M Schweidtmann, Tim Kerkenhoff + Show 5 more

Open Access

PDF Available

https://doi.org/10.1007/s12532-021-00204-y

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Gaussian processes (Kriging) are interpolating data-driven models that are frequently applied in various disciplines. Often, Gaussian processes are trained on datasets and are subsequently embedded as surrogate models in optimization problems. These optimization problems are nonconvex and global optimization is desired. However, previous literature observed computational burdens limiting deterministic global optimization to Gaussian processes trained on few data points. We propose a reduced-space formulation for deterministic global optimization with trained Gaussian processes embedded. For optimization, the branch-and-bound solver branches only on the free variables and McCormick relaxations are propagated through explicit Gaussian process models. The approach also leads to significantly smaller and computationally cheaper subproblems for lower and upper bounding. To further accelerate convergence, we derive envelopes of common covariance functions for GPs and tight relaxations of acquisition functions used in Bayesian optimization including expected improvement, probability of improvement, and lower confidence bound. In total, we reduce computational time by orders of magnitude compared to state-of-the-art methods, thus overcoming previous computational burdens. We demonstrate the performance and scaling of the proposed method and apply it to Bayesian optimization with global optimization of the acquisition function and chance-constrained programming. The Gaussian process models, acquisition functions, and training scripts are available open-source within the “MeLOn—MachineLearning Models for Optimization” toolbox (https://git.rwth-aachen.de/avt.svt/public/MeLOn).

Highlights

A Gaussian process (GP) is a stochastic process where any finite collection of random variables has a multivariate Gaussian distribution; they can be understood as an infinite-dimensional generalization of multivariate Gaussian distributions [66]
GPs originate from geostatistics [46] and gained popularity for the design and analysis of computer experiments (DACE) since 1989 [68]
expected improvement (EI) is the acquisition function that is most commonly used in Bayesian optimization [40]

Summary

Introduction

A Gaussian process (GP) is a stochastic process where any finite collection of random variables has a multivariate Gaussian distribution; they can be understood as an infinite-dimensional generalization of multivariate Gaussian distributions [66]. All these local methods have the drawback that they can lead to suboptimal solutions, because the resulting optimization problems are nonconvex. They solve the problem globally using BARON in GAMS by providing the full set of GP equations as equality constraints This leads to additional intermediate optimization variables besides the degrees of freedom of the problem. We refer to the problem formulation where the GP is described by equality constraints and additional optimization variables as a full-space (FS) formulation It is commonly used in modeling environments, e.g., GAMS, that interface with state-of-the-art global solvers such as ANTIGONE [57], BARON [79], and SCIP [50]. Note that the MeLOn toolbox is automatically included as a submodule in our new MAiNGO release

Optimization problem formulations

Full-space formulation

Reduced-space formulation

Gaussian processes

Posterior

Maximum a posteriori

Convex and concave relaxations

Covariance functions

Gaussian probability density function

Gaussian cumulative distribution function

Lower confidence bound acquisition function

Probability of improvement acquisition function

Expected improvement acquisition function

Implementation

Numerical results

Illustrative example and scaling of the algorithm

Chance-constrained programming

Bayesian optimization

Conclusions

A Derivations of convex and concave relaxations

Probability density function of Gaussian distribution

Monotonicity

Componentwise convexity

Relaxations

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Programming Computation	Publication Date: Jun 25, 2021
Citations: 32	License type: open-access

R Discovery Prime

Deterministic global optimization with Gaussian processes embedded

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Mathematical Programming Computation

Lead the way for us

Similar Papers

Global Optimization of Gaussian processes
...
arXiv (Cornell University) | VOL. -
, et. al. ...
21 May 2020
arXiv (Cornell University) | VOL. -

Deterministic global process optimization: Accurate (single-species) properties via artificial neural networks
Artur M Schweidtmann ... Alexander Mitsos
Computers & Chemical Engineering | VOL. 121
Artur M Schweidtmann, et. al.Artur M Schweidtmann ... Alexander Mitsos
16 Oct 2018
Computers & Chemical Engineering | VOL. 121

Bayes_Opt-SWMM: A Gaussian process-based Bayesian optimization tool for real-time flood modeling with SWMM
Ahad Hasan Tanim ... Erfan Goharian
Environmental Modelling and Software | VOL. 179
Ahad Hasan Tanim, et. al.Ahad Hasan Tanim ... Erfan Goharian
19 Jun 2024
Environmental Modelling and Software | VOL. 179

Machine learning with knowledge constraints for process optimization of open-air perovskite solar cell manufacturing
Zhe Liu ... Austin C Flick
Joule | VOL. 6
Zhe Liu, et. al.Zhe Liu ... Austin C Flick
01 Apr 2022
Joule | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deterministic global optimization with Gaussian processes embedded

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Mathematical Programming Computation