Graphics Processing Units Processing Research Articles

Abstract. Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require high spatial resolution (under the kilometer scale). Conventional ice flow models mainly execute on central processing units (CPUs), which feature limited parallel processing capabilities and peak memory bandwidth. This may hinder model scalability and result in long run times, requiring significant computational resources. As an alternative, graphics processing units (GPUs) are ideally suited for high spatial resolution, as the calculations can be performed concurrently by thousands of threads, processing most of the computational domain simultaneously. In this study, we combine a GPU-based approach with the pseudo-transient (PT) method, an accelerated iterative and matrix-free solution strategy, and investigate its performance for finite elements and unstructured meshes with application to two-dimensional (2-D) models of real glaciers at a regional scale. For both the Jakobshavn and Pine Island glacier models, the number of nonlinear PT iterations required to converge a given number of vertices (N) scales in the order of 𝒪(N1.2) or better. We further compare the performance of the PT CUDA C implementation with a standard finite-element CPU-based implementation using the price-to-performance metric. The price of a single Tesla V100 GPU is 1.5 times that of two Intel Xeon Gold 6140 CPUs. We expect a minimum speedup of at least 1.5 times to justify the Tesla V100 GPU price to performance. Our developments result in a GPU-based implementation that achieves this goal with a speedup beyond 1.5 times. This study represents a first step toward leveraging GPU processing power, enabling more accurate polar ice discharge predictions. The insights gained will benefit efforts to diminish spatial resolution constraints at higher computing performance. The higher computing performance will allow for ensembles of ice-sheet flow simulations to be run at the continental scale and higher resolution, a previously challenging task. The advances will further enable the quantification of model sensitivity to changes in upcoming climate forcings. These findings will significantly benefit process-oriented sea-level-projection studies over the coming decades.

In the field of quantum mechanics, the theoretical study of the interaction between intense laser field and atoms and molecules depends very much on the numerical solution of the time-dependent Schrödinger equation. However, solving the three-dimensional time-dependent Schrödinger equation is not a simple task, and the analytical solution cannot be obtained, so it can only be solved numerically with the help of computer. In order to shorten the computing time and obtain the results quickly, it is necessary to use parallel methods to speed up computing. In this paper, under the background of strong field ionization, the three-dimensional time-dependent Schrödinger equation of hydrogen atom is solved in parallel, and the suprathreshold ionization of hydrogen atom under the action of linearly polarized infrared laser electric field is taken for example. Based on the spherical polar coordinate system, the time-dependent Schrödinger equation is discretized by the splitting operator-Fourier transform method, and the photoelectron continuous state wave function under the length gauge can be obtained. In Graphics processing unit (GPU) accelerated applications, the sequential portion of the workload runs on central processing unit (CPU) (which is optimized for single-threaded performance), while the compute-intensive part of the application runs in parallel on thousands of GPU cores. The GPU can make full use of the advantage of fine-grained parallelism based on multi-thread structure to realize parallel acceleration of the whole algorithm. Two accelerated computing modes of CPU parallel and GPU parallel are adopted, and their parallel acceleration performance is discussed. Compared with the results from the existing physical laws, the calculation error is also within an acceptable range, and the result is also consistent with the result from the existing physical laws of suprathreshold ionization, which also verifies the correctness of the program. In order to obtain a relatively accurate acceleration ratio, many different experiments are carried out. Computational experiments show that under the condition of ensuring accuracy, the GPU parallel computing speeds by up to about 60 times maximally based on the computational performance of CPU. It can be seen that the accelerated numerical solution of three-dimensional time-dependent Schrödinger equation based on GPU can significantly shorten the computational time. This work has important guiding significance for rapidly solving the three-dimensional time-dependent Schrödinger equation by using GPU.

Graphics Processing Units Processing Research Articles

Related Topics

Articles published on Graphics Processing Units Processing

An Implementation of LASER Beam Welding Simulation on Graphics Processing Unit Using CUDA

Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)

GPU acceleration of conjugate gradient method obtaining Green's function for transport-property calculation

A Heterogeneous Parallel Algorithm for Euler-Lagrange Simulations of Liquid in Supersonic Flow

The implementation of the three-dimensional unified gas-kinetic wave-particle method on multiple graphics processing units

A Hybrid GPU and CPU Parallel Computing Method to Accelerate Millimeter-Wave Imaging

Implementation of Beeman's algorithm to calculate execution time on GPU using CUDA

Sentence-Level Sentiment Classification A Comparative Study Between Deep Learning Models

Analysis of Ionicity-Magnetism Competition in 2D-MX3 Halides towards a Low-Dimensional Materials Study Based on GPU-Enabled Computational Systems.

A graphics processing unit-based robust numerical model for solute transport driven by torrential flow condition

Comparative study of the implementation of the Lagrange interpolation algorithm on GPU and CPU using CUDA to compute the density of a material at different temperatures

GPU simulation with Opticks: The future of optical simulations for LZ

Graph Reachability on Parallel Many-Core Architectures

Numerical solution of three-dimensional time-dependent Schrödinger equation based on graphic processing unit acceleration

A Performance Study of Moving Particle Semi-Implicit Method for Incompressible Fluid Flow on GPU

NnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks

Improving classification and clustering techniques using GPUs

Accelerated organ region segmentation by the revised radial basis function network using a graphics processing unit.

Layer-based visualization and biomedical information exploration of multi-channel large histological data

Live Ultrasound Color-Encoded Speckle Imaging Platform for Real-Time Complex Flow Visualization In Vivo.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Graphics Processing Units Processing Research Articles

Related Topics

Articles published on Graphics Processing Units Processing

An Implementation of LASER Beam Welding Simulation on Graphics Processing Unit Using CUDA

Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)

GPU acceleration of conjugate gradient method obtaining Green's function for transport-property calculation

A Heterogeneous Parallel Algorithm for Euler-Lagrange Simulations of Liquid in Supersonic Flow

The implementation of the three-dimensional unified gas-kinetic wave-particle method on multiple graphics processing units

A Hybrid GPU and CPU Parallel Computing Method to Accelerate Millimeter-Wave Imaging

Implementation of Beeman's algorithm to calculate execution time on GPU using CUDA

Sentence-Level Sentiment Classification A Comparative Study Between Deep Learning Models

Analysis of Ionicity-Magnetism Competition in 2D-MX3 Halides towards a Low-Dimensional Materials Study Based on GPU-Enabled Computational Systems.

A graphics processing unit-based robust numerical model for solute transport driven by torrential flow condition

Comparative study of the implementation of the Lagrange interpolation algorithm on GPU and CPU using CUDA to compute the density of a material at different temperatures

GPU simulation with Opticks: The future of optical simulations for LZ

Graph Reachability on Parallel Many-Core Architectures

Numerical solution of three-dimensional time-dependent Schrödinger equation based on graphic processing unit acceleration

A Performance Study of Moving Particle Semi-Implicit Method for Incompressible Fluid Flow on GPU

NnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks

Improving classification and clustering techniques using GPUs

Accelerated organ region segmentation by the revised radial basis function network using a graphics processing unit.

Layer-based visualization and biomedical information exploration of multi-channel large histological data

Live Ultrasound Color-Encoded Speckle Imaging Platform for Real-Time Complex Flow Visualization In Vivo.