GPGPU Research Articles

Introduction. At present, in science and technology, new computational problems constantly arise with large volumes of data, the solution of which requires the use of powerful supercomputers. Most of these problems come down to solving systems of linear algebraic equations (SLAE). The main problem of solving problems on a computer is to obtain reliable solutions with minimal computing resources. However, the problem that is solved on a computer always contains approximate data regarding the original task (due to errors in the initial data, errors when entering numerical data into the computer, etc.). Thus, the mathematical properties of a computer problem can differ significantly from the properties of the original problem. It is necessary to solve problems taking into account approximate data and analyze computer results. Despite the significant results of research in the field of linear algebra, work in the direction of overcoming the existing problems of computer solving problems with approximate data is further aggravated by the use of contemporary supercomputers, do not lose their significance and require further development. Today, the most high-performance supercomputers are parallel ones with graphic processors. The architectural and technological features of these computers make it possible to significantly increase the efficiency of solving problems of large volumes at relatively low energy costs. The purpose of the article is to develop new parallel algorithms for solving systems of linear algebraic equations with approximate data on supercomputers with graphic processors that implement the automatic adjustment of the algorithms to the effective computer architecture and the mathematical properties of the problem, identified in the computer, as well with estimates of the reliability of the results. Results. A methodology for creating parallel algorithms for supercomputers with graphic processors that implement the study of the mathematical properties of linear systems with approximate data and the algorithms with the analysis of the reliability of the results are described. The results of computational experiments on the SKIT-4 supercomputer are presented. Conclusions. Parallel algorithms have been created for investigating and solving linear systems with approximate data on supercomputers with graphic processors. Numerical experiments with the new algorithms showed a significant acceleration of calculations with a guarantee of the reliability of the results. Keywords: systems of linear algebraic equations, hybrid algorithm, approximate data, reliability of the results, GPU computers.

Software-deﬁned Radio is a programmable transceiver with the capability of operating various wireless communication protocols without the need to change or update the hardware. Consequently, Software-deﬁned Radio has earned a lot of attention and is of great signiﬁcance to both academia, military and aerospace industry. Components of Software-deﬁned Radio (e.g. mixers, filters, amplifiers, modulators/demodulators, detectors, etc.) implemented by means of software on a personal computer or embedded system. Operation of signal processing are handed over to the baseband processor, rather than being done in special electronic circuits. Baseband processors are implemented through employing various types of hardware platforms, such as General Purpose Processors, Graphics Processing Units, Digital Signal Processors, and Field Programmable Gate Arrays. Each of these platforms is associated with their own set of advantages and disadvantages. In this paper was proposed a comparison of the state-of-the-art hardware platforms in the context of implementation Software-deﬁned Radio transceivers. For comparison was determined as follow criteria: computational power of hardware platform, power consumption, complexity of developing, and cost of tools and equipment. First approaches to realizing baseband processors is using a General Purpose Processor and accelerating by Graphics Processing Units. But General Purpose Processor and Graphics Processing Units execute software instructions in the sequential order. For this reason, General Purpose Processors are not convenient for high-throughput computing with real-time requirements. Also this hardware platforms have increased power consumption. This aspect does not allow use General Purpose Processor and Graphics Processing Units in small and portable Software-deﬁned Radio transceivers. In other hand, General Purpose Processors are preferable hardware platform by researchers and beginners due to their ﬂexibility and programmability. Therefore, General Purpose Processors and Graphics Processing Units is highly recommended for prototyping Software-deﬁned Radio platforms. Digital Signal Processor was reviewed as alternative approach for implementing baseband processors. Digital Signal Processors is a particular type of General Purpose Processors that is optimized to process digital signals. Digital Signal Processors have similar disadvantage with insufficient computational power, but some manufacturer sell energy optimized Digital Signal Processors. Consequently, Digital Signal Processor is commonly used in small and portable Software-deﬁned Radio transceivers. Field Programmable Gate Arrays and System-on-Chips with Field Programmable Gate Array are strongly recommended for high-performance Software-deﬁned Radio platforms. This hardware platforms combine the ﬂexibility of processors and efﬁciency of small Digital Signal Processor. Field Programmable Gate Arrays can achieve a high level of parallelism in executing digital signal processing. However, the designers must have a high degree in digital electronics and good acknowledgement of hardware description languages. After the research, was proposed own flexible architecture Software-deﬁned Radio transceiver and methods for development.

GPGPU Research Articles

Related Topics

Articles published on GPGPU

Approximate Cache in GPGPUs

Real time object detection and trackingsystem for video surveillance system

Cooperation of CUDA and Intel multi-core architecture in the independent component analysis algorithm for EEG data

Optimizing non-coalesced memory access for irregular applications with GPU computing

Comprehensive high-speed reconciliation for continuous-variable quantum key distribution

Acceleration and enhancement of reliability of simulated annealing for optimizing thinning schedule of a forest stand

Passively parallel regularized stokeslets.

An Adaptive Clock Scheme Exploiting Instruction-Based Dynamic Timing Slack for a GPGPU Architecture

Special issue on SoC and AI processors

AB9: A neural processor for inference acceleration

One size does not fit all: accelerating OLAP workloads with GPUs

Parallel Algorithms for Solving Linear Systems on Hybrid Computers

Soil Monitor: an internet platform to challenge soil sealing in Italy

PowerCoord: Power capping coordination for multi-CPU/GPU servers using reinforcement learning

Comparison of Baseband Processors in Terms of Realization SDR-Transceivers

Computation of minimal unsatisfiable subformulas for SAT-based digital circuit error diagnosis

Exceeding Conservative Limits: A Consolidated Analysis on Modern Hardware Margins

Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems

Simulation of Fire with a Gas Kinetic Scheme on Distributed GPGPU Architectures

Trivial Bypassing in GPGPUs

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

GPGPU Research Articles

Related Topics

Articles published on GPGPU

Approximate Cache in GPGPUs

Real time object detection and trackingsystem for video surveillance system

Cooperation of CUDA and Intel multi-core architecture in the independent component analysis algorithm for EEG data

Optimizing non-coalesced memory access for irregular applications with GPU computing

Comprehensive high-speed reconciliation for continuous-variable quantum key distribution

Acceleration and enhancement of reliability of simulated annealing for optimizing thinning schedule of a forest stand

Passively parallel regularized stokeslets.

An Adaptive Clock Scheme Exploiting Instruction-Based Dynamic Timing Slack for a GPGPU Architecture

Special issue on SoC and AI processors

AB9: A neural processor for inference acceleration

One size does not fit all: accelerating OLAP workloads with GPUs

Parallel Algorithms for Solving Linear Systems on Hybrid Computers

Soil Monitor: an internet platform to challenge soil sealing in Italy

PowerCoord: Power capping coordination for multi-CPU/GPU servers using reinforcement learning

Comparison of Baseband Processors in Terms of Realization SDR-Transceivers

Computation of minimal unsatisfiable subformulas for SAT-based digital circuit error diagnosis

Exceeding Conservative Limits: A Consolidated Analysis on Modern Hardware Margins

Applying the swept rule for solving explicit partial differential equations on heterogeneous computing systems

Simulation of Fire with a Gas Kinetic Scheme on Distributed GPGPU Architectures

Trivial Bypassing in GPGPUs