GPU Modules Research Articles

Traditional visual-inertial simultaneous localization and mapping algorithms are usually designed based on CPUs, and they cannot effectively utilize the parallel computing function of GPUs if they are directly transplanted to an embedded board with a GPU module. However, the computing power of embedded devices is limited. It is unreasonable for the visual-inertial simultaneous localization and mapping algorithm to occupy most CPU computing resources while the GPU is idle. In this article, a parallelization scheme for the VINS-Mono algorithm based on GPU parallel computing technology is proposed. Based on the compute unified device architecture, the construction and solution of the incremental equation are parallelized in the nonlinear optimization process of the algorithm, and the parallelization methods provided by cuSOLVER and cuBLAS are used to carry out the marginalization of the algorithm. In addition, the program for the detection and matching of image feature points in the process of optical flow tracking is rewritten in the algorithm to realize the parallelization of optical flow tracking. After parallelization, the algorithm is found to run well on a heterogeneous computing model composed of a CPU and GPU and can fully exploit the parallel computing power of the GPU. The proposed method was tested on an NVIDIA’s Jetson TX2 module and compared with the VINS-Mono algorithm; the speeds of the construction and solution of the incremental equation were found to be the same, but the optical flow tracking and marginalization speed of the proposed scheme exhibited improvements of about 1.5–1.7 times and 1.9 times, respectively.

Read full abstract

TSV interconnect based 3D/2.5D packaging has gained significant attention since its introduction in FPGA (for die partitioning) and HBM integrated GPU module (for gaming application). The performance potential offered by this technology is unequalled by any other packaging platform today. High-end applications like deep learning, datacenter networking, AR/VR, and autonomous driving are becoming real, thereby pushing the limits of other current packaging platforms. Fueled by increasing bandwidth needs for moving data in cloud-computing and supercomputing applications, performance-driven markets have adopted 3D stacked technologies in a row. Imaging, as the first market adopter of 3D integration, is propelling the market with an increasing number of sensors in smartphones and tablets, including 3D imaging. TSV-based products can be classified in three ranges: low, middle, and high-end. The middle and high-end product markets like CMOS image sensor, memory cube, and interposer are based on a via-middle process. In low-end products, we can also find TSV based on via-middle (i.e. in Apple's fingerprint sensor), but for cost reasons the MEMS industry is using essentially a via-last process, which is cheaper than a via-middle process. TSV's penetration rate in low-end products will remain stable, with the main source of growth due to RF filters in smartphone front-end modules, which keep increasing in order to support the different frequency bands used in 5G mobile communications protocol. This presentation will discuss about the market and technology trends of the TSV based 3D/2.5D packaging.

Read full abstract

GPU Modules Research Articles

Related Topics

Articles published on GPU Modules

RouteReplies: Alleviating Long Latency in Many-Chip-Module GPUs

Parallel VINS-Mono algorithm based on GPUs in embedded devices

Evaluating softcore GPU in SRAM-based FPGA under radiation-induced effects

Искусственный интеллект в задачах распознавания динамических жестов

Deep Convolutional Network for Stereo Depth Mapping in Binocular Endoscopy

What is driving the TSV business: Market & Technology Trends

METHODS FOR PROTECTED CONDUCTIVE-COOLING HARDWARE-COMPUTING PLATFORMS DEVELOPMENT FOR VIDEO IMAGE PROCESSING TASKS

MCM-GPU

Expérimentation du principe de délégation GPU pour la simulation multiagent. Les boids de Reynolds comme cas d'étude

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

GPU Modules Research Articles

Related Topics

Articles published on GPU Modules

RouteReplies: Alleviating Long Latency in Many-Chip-Module GPUs

Parallel VINS-Mono algorithm based on GPUs in embedded devices

Evaluating softcore GPU in SRAM-based FPGA under radiation-induced effects

Искусственный интеллект в задачах распознавания динамических жестов

Deep Convolutional Network for Stereo Depth Mapping in Binocular Endoscopy

What is driving the TSV business: Market &amp; Technology Trends

METHODS FOR PROTECTED CONDUCTIVE-COOLING HARDWARE-COMPUTING PLATFORMS DEVELOPMENT FOR VIDEO IMAGE PROCESSING TASKS

MCM-GPU

Expérimentation du principe de délégation GPU pour la simulation multiagent. Les boids de Reynolds comme cas d'étude

What is driving the TSV business: Market & Technology Trends