Abstract
Physical data are typically generated by experiments and computations in limited parameter regimes. When datasets generated using such disparate methods are combined into one dataset, the resulting dataset is typically sparse, with dense "islands" in a potentially high-dimensional parameter space, and predictions must be interpolated among such islands. Using plasma transport data as our example, we propose a multifidelity Gaussian-process regression framework that incorporates physical data from multiple sources at multiple fidelities. The impact of the proposed framework varies from little improvement over simpler approaches to qualitatively changing the prediction with consistently increased confidence in regions lacking high-fidelity data. By varying low- and high-fidelity data sources, we demonstrate an approach for determining when multifidelity Gaussian-process regression adds value over single-fidelity regression and therefore when its additional computational costs are merited. We also examine the case in which the outputs of the low- and high-fidelity models correspond to different physical quantities, one of which may be intrinsically computationally cheaper to compute. We conclude by analyzing strategies for sampling high-fidelity data for use in this framework, and we develop a simple sampling approach for reducing regression error across large gaps in data.
Highlights
The generation of high-fidelity (HF) data requires substantial resources that limit the volume of data that can be generated
We have investigated the use of MF-Gaussian-process regression (GPR) to interpolate plasma transport data over a wide parameter space in which HF data are available in localized patches
We have examined the improvements in both the predicted mean and the predicted uncertainty that MF-GPR provides over GPR
Summary
The generation of high-fidelity (HF) data requires substantial resources that limit the volume of data that can be generated. We will examine the situation in which there are islands of HF data in parameter space, possibly from different sources, and we will fill the space between these islands with easier-to-compute LF data Such an approach utilizes multifidelity (MF) extensions [11] of GPR.
Full Text
Topics from this Paper
Large Gaps In Data
Multifidelity Gaussian-process Regression
High-fidelity Data
Multiple Fidelities
Gaps In Data
+ Show 5 more
Create a personalized feed of these topics
Get StartedTalk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Similar Papers
Applied Energy
Sep 1, 2021
Studia Geophysica et Geodaetica
Jul 1, 2019
Journal of Geodesy
Oct 18, 2022
Computer Methods in Applied Mechanics and Engineering
Mar 1, 2022
Environmental Chemistry
Jan 1, 2023
Ecology and evolution
Sep 1, 2023
Nov 27, 2018
The Astronomical Journal
Oct 14, 2022
Social Science Research Network
Jun 26, 2014
Canadian Water Resources Journal / Revue canadienne des ressources hydriques
Jan 1, 2005
Global Ecology and Conservation
Jun 1, 2021
Sep 1, 2011
Journal of Geophysics and Engineering
Aug 1, 2020
Environmental Health Perspectives
Apr 1, 2005
Physical Review E
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023
Physical Review E
Nov 27, 2023