Efficient Gradient Computation Research Articles

Surrogate models provide an affordable alternative to the evaluation of expensive deterministic functions. However, the construction of accurate surrogate models with many independent variables is currently prohibitive because they require a large number of function evaluations for the desired accuracy. Gradient-enhanced kriging has the potential to reduce the number of evaluations when efficient gradient computation, such as an adjoint method, is available. However, current gradient-enhanced kriging methods do not scale well with the number of sampling points because of the rapid growth in the size of the correlation matrix, where new information is added for each sampling point in each direction of the design space. Furthermore, they do not scale well with the number of independent variables because of the increase in the number of hyperparameters that must be estimated. To address this issue, we develop a new gradient-enhanced surrogate model approach that drastically reduces the number of hyperparameters through the use of the partial least squares method to maintain accuracy. In addition, this method is able to control the size of the correlation matrix by adding only relevant points defined by the information provided by the partial least squares method. To validate our method, we compare the global accuracy of the proposed method with conventional kriging surrogate models on two analytic functions with up to 100 dimensions, as well as engineering problems of varied complexity with up to 15 dimensions. We show that the proposed method requires fewer sampling points than conventional methods to obtain the desired accuracy, or it provides more accuracy for a fixed budget of sampling points. In some cases, we get models that are over three times more accurate than previously developed surrogate models for the same computational time, and over 3200 times faster than standard gradient-enhanced kriging models for the same accuracy.

Read full abstract

Many scientific problems such as classifier training or medical image reconstruction can be expressed as minimization of differentiable real-valued cost functions and solved with iterative gradient-based methods. Adjoint algorithmic differentiation (AAD) enables automated computation of gradients of such cost functions implemented as computer programs. To backpropagate adjoint derivatives, excessive memory is potentially required to store the intermediate partial derivatives on a dedicated data structure, referred to as the “tape”. Parallelization is difficult because threads need to synchronize their accesses during taping and backpropagation. This situation is aggravated for many-core architectures, such as Graphics Processing Units (GPUs), because of the large number of light-weight threads and the limited memory size in general as well as per thread. We show how these limitations can be mediated if the cost function is expressed using GPU-accelerated vector and matrix operations which are recognized as intrinsic functions by our AAD software. We compare this approach with naive and vectorized implementations for CPUs. We use four increasingly complex cost functions to evaluate the performance with respect to memory consumption and gradient computation times. Using vectorization, CPU and GPU memory consumption could be substantially reduced compared to the naive reference implementation, in some cases even by an order of complexity. The vectorization allowed usage of optimized parallel libraries during forward and reverse passes which resulted in high speedups for the vectorized CPU version compared to the naive reference implementation. The GPU version achieved an additional speedup of 7.5±4.4, showing that the processing power of GPUs can be utilized for AAD using this concept. Furthermore, we show how this software can be systematically extended for more complex problems such as nonlinear absorption reconstruction for fluorescence-mediated tomography. Program summaryProgram title: AD-GPUCatalogue identifier: AEYX_v1_0Program summary URL: http://cpc.cs.qub.ac.uk/summaries/AEYX_v1_0.htmlProgram obtainable from: CPC Program Library, Queen’s University, Belfast, N. IrelandLicensing provisions: Standard CPC licence, http://cpc.cs.qub.ac.uk/licence/licence.htmlNo. of lines in distributed program, including test data, etc.: 16715No. of bytes in distributed program, including test data, etc.: 143683Distribution format: tar.gzProgramming language: C++ and CUDA.Computer: Any computer with a compatible C++ compiler and a GPU with CUDA capability 3.0 or higher.Operating system: Windows 7 or Linux.RAM: 16 GbyteClassification: 4.9, 4.12, 6.1, 6.5.External routines: CUDA 6.5, Intel MKL (optional) and routines from BLAS, LAPACK and CUBLASNature of problem: Gradients are required for many optimization problems, e.g. classifier training or nonlinear image reconstruction. Often, the function, of which the gradient is required, can be implemented as a computer program. Then, algorithmic differentiation methods can be used to compute the gradient. Depending on the approach this may result in excessive requirements of computational resources, i.e. memory and arithmetic computations. GPUs provide massive computational resources but require special considerations to distribute the workload onto many light-weight threads.Solution method: Adjoint algorithmic differentiation allows efficient computation of gradients of cost functions given as computer programs. The gradient can be theoretically computed using a similar amount of arithmetic operations as one function evaluation. Optimal usage of parallel processors and limited memory is a major challenge which can be mediated by the use of vectorization.Restrictions: To use the GPU-accelerated adjoint algorithmic differentiation method, the cost function must be implemented using the provided AD-GPU intrinsics for matrix and vector operations. Unusual features:GPU-acceleration.Additional comments: The code uses some features of C++11, e.g. std::shared ptr. Alternatively, the boost library can be used.Running time: The time to run the example program is a few minutes or up to a few hours to reproduce the performance measurements.

Read full abstract

Efficient Gradient Computation Research Articles

Related Topics

Articles published on Efficient Gradient Computation

Identification of a nonlinear spring and damper characteristics of a motorcycle suspension using test ride data

OCEAN: An On-Chip Incremental-Learning Enhanced Artificial Neural Network Processor With Multiple Gated-Recurrent-Unit Accelerators

Gradient-based optimization for regression in the functional tensor-train format

An adjoint method for gradient-based optimization of stellarator coil shapes

Gradient-enhanced kriging for high-dimensional problems

Scalable Backpropagation for Gaussian Processes using Celerite

Multiscale gradient computation for flow in heterogeneous porous media

Gradient-Based Optimization for Poroelastic and Viscoelastic MR Elastography

GPU-accelerated adjoint algorithmic differentiation

Parameters affecting laterally loaded piles in frozen soils by an efficient sensitivity analysis method

SAGRAD: A Program for Neural Network Training with Simulated Annealing and the Conjugate Gradient Method.

Reservoir management optimization using well-specific upscaling and control switching

Stochastic reduced order models for inverse problems under uncertainty

Utilization of efficient gradient and Hessian computations in the force field optimization process of molecular simulations

Improvement of state profile accuracy in nonlinear dynamic optimization with the quasi‐sequential approach

Highly Efficient Gradient Computation for Density-Constrained Analytical Placement

New advances in three-dimensional controlled-source electromagnetic inversion

Response Gradients for Nonlinear Beam-Column Elements under Large Displacements

Optimization of nonlinear wave function parameters

On Efficient Computation of the Optimization Problem Arising in the Inverse Modeling of Non-Stationary Multiphase Multicomponent Flow Through Porous Media

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Efficient Gradient Computation Research Articles

Related Topics

Articles published on Efficient Gradient Computation

Identification of a nonlinear spring and damper characteristics of a motorcycle suspension using test ride data

OCEAN: An On-Chip Incremental-Learning Enhanced Artificial Neural Network Processor With Multiple Gated-Recurrent-Unit Accelerators

Gradient-based optimization for regression in the functional tensor-train format

An adjoint method for gradient-based optimization of stellarator coil shapes

Gradient-enhanced kriging for high-dimensional problems

Scalable Backpropagation for Gaussian Processes using Celerite

Multiscale gradient computation for flow in heterogeneous porous media

Gradient-Based Optimization for Poroelastic and Viscoelastic MR Elastography

GPU-accelerated adjoint algorithmic differentiation

Parameters affecting laterally loaded piles in frozen soils by an efficient sensitivity analysis method

SAGRAD: A Program for Neural Network Training with Simulated Annealing and the Conjugate Gradient Method.

Reservoir management optimization using well-specific upscaling and control switching

Stochastic reduced order models for inverse problems under uncertainty

Utilization of efficient gradient and Hessian computations in the force field optimization process of molecular simulations

Improvement of state profile accuracy in nonlinear dynamic optimization with the quasi‐sequential approach

Highly Efficient Gradient Computation for Density-Constrained Analytical Placement

New advances in three-dimensional controlled-source electromagnetic inversion

Response Gradients for Nonlinear Beam-Column Elements under Large Displacements

Optimization of nonlinear wave function parameters

On Efficient Computation of the Optimization Problem Arising in the Inverse Modeling of Non-Stationary Multiphase Multicomponent Flow Through Porous Media