Abstract

Physical data are typically generated by experiments and computations in limited parameter regimes. When datasets generated using such disparate methods are combined into one dataset, the resulting dataset is typically sparse, with dense "islands" in a potentially high-dimensional parameter space, and predictions must be interpolated among such islands. Using plasma transport data as our example, we propose a multifidelity Gaussian-process regression framework that incorporates physical data from multiple sources at multiple fidelities. The impact of the proposed framework varies from little improvement over simpler approaches to qualitatively changing the prediction with consistently increased confidence in regions lacking high-fidelity data. By varying low- and high-fidelity data sources, we demonstrate an approach for determining when multifidelity Gaussian-process regression adds value over single-fidelity regression and therefore when its additional computational costs are merited. We also examine the case in which the outputs of the low- and high-fidelity models correspond to different physical quantities, one of which may be intrinsically computationally cheaper to compute. We conclude by analyzing strategies for sampling high-fidelity data for use in this framework, and we develop a simple sampling approach for reducing regression error across large gaps in data.

Highlights

  • The generation of high-fidelity (HF) data requires substantial resources that limit the volume of data that can be generated

  • We have investigated the use of MF-Gaussian-process regression (GPR) to interpolate plasma transport data over a wide parameter space in which HF data are available in localized patches

  • We have examined the improvements in both the predicted mean and the predicted uncertainty that MF-GPR provides over GPR

Read more

Summary

INTRODUCTION

The generation of high-fidelity (HF) data requires substantial resources that limit the volume of data that can be generated. We will examine the situation in which there are islands of HF data in parameter space, possibly from different sources, and we will fill the space between these islands with easier-to-compute LF data Such an approach utilizes multifidelity (MF) extensions [11] of GPR.

DATASET AND REGRESSION METHODS
Ionic transport-coefficient dataset
Single-fidelity regression
Multifidelity Gaussian-process regression
Error calculations and computation cost
MULTIFIDELITY REGRESSION OF PLASMA TRANSPORT-COEFFICIENT DATA
REGRESSION OF SPARSE DISPARATE DATA
Self-diffusion and viscosity predictions versus temperature
Sampling methods for HF data
Regression error
CONCLUSIONS AND OUTLOOK

Full Text

Published Version
Open DOI Link

Get access to 115M+ research papers

Discover from 40M+ Open access, 2M+ Pre-prints, 9.5M Topics and 32K+ Journals.

Sign Up Now! It's FREE

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call