Processor Design Research Articles

The concept of “all cores are created equal” has been popular for several decades due to its simplicity and effectiveness in CPU (Central Processing Unit) design. The more cores the CPU has, the higher performance the host owns and the higher the power consumption. However, power-saving is also one of the key goals for servers in data centers and embedded devices (e.g., mobile phones). The big.LITTLE multicore architecture, which contains high-performance cores (namely big core) and power-saved cores (namely little core), has been developed by ARM (Advanced RISC Machine) and Intel to trade off performance and power efficiency. Facing the new heterogeneous computing architecture, the traditional lock algorithms, which are designed to run on homogeneous computing architecture, cannot work optimally as usual and drop into the performance issue for the difference between big core and little core. In our preliminary experiment, we observed that, in the big.LITTLE multicore architecture, all these lock algorithms exhibit sub-optimal performance. The FIFO-based (First In First Out) locks experience throughput degradation, while the performance of competition-based locks can be divided into two categories. One of them is big-core-friendly, so their tail latency increases significantly; the other is little-core-friendly. Not only does the tail latency increase, but the throughput is also degraded. Motivated by this observation, we propose a Core-Aware Lock for the big.LITTLE multicore architecture named CAL, which keeps each core having an equal opportunity to access the critical section in the program. The core idea of the CAL is to take the slowdown ratio as the matric to reorder lock requests of these big and little cores. By evaluating benchmarks and a real-world application named LevelDB, CAL is confirmed to achieve fairness goals in heterogeneous computing architecture without sacrificing the performance of the big core. Compared to several traditional lock algorithms, the CAL’s fairness has increased by up to 67%; and Its throughput is 26% higher than FIFO-based locks and 53% higher than competition-based locks, respectively. In addition, the tail latency of CAL is always kept at a low level.

Read full abstract

Context. The digital signal processing is applied in many fields of science, technology and human activity. One of the ways of implementing algorithms of digital signal processing is the development of coprocessors as an integral part of well-known architectures. In the case of developing a pipelined device, the presented approach will allow to use software and hardware tools of the appropriate architecture, provide the faster execution of signal processing algorithms, reduce the number of cycles and memory accesses. Objective. Objectives are design and characterization study of a pipelined RISC-V processor and coprocessor of digital signal processing which performs fast Fourier transform. Method. Analyzing technical literature and existing decisions allow to assess advantages and disadvantages of modern developments and on the basis of which to form the relevance of the selected topic. Model designing and simulation results allow to examine a model efficiency, to determine weak components’ parts and to improve model parameters. Results. The pipelined RISC-V processor has been designed which executes a basic set of instructions. Execution time of assembly program on the single-cycled and the pipelined processors have been analyzed. According to the results, the test program on the pipelined processor is executed in 29 cycles, while on the single-cycle processor it takes 60 cycles. The structure of the coprocessor for the fast Fourier transform algorithm and a set of processor instructions that allow working with the coprocessor have been developed. The number of cycles of the coprocessor based on Radix-2 fast Fourier transform algorithm for 512 points is 2358 cycles, and for 1024 points is 5180 cycles. Conclusions. Conducted researches and calculations have showed that the application of the developed hardware coprocessor reduces the fast Fourier transform algorithm execution time and the load of the pipelined processor during calculations.

Read full abstract

Processor Design Research Articles

Related Topics

Articles published on Processor Design

Thermal design of a frequency hopping processor

Design of Low Power Control Unit for RISC-V Processor Core

Design of Variation Tolerant Near Threshold Processor Using Artificial Ecosystem Optimizer with Hybrid Deep Learning

Structural design and thermodynamic analysis of high heat dissipation on board computers based on VPX architecture

Integrating error correction and detection techniques in RISC-V processor microarchitecture for enhanced reliability

Implementation of Application Specific Instruction set Processor for Approximate Computing

Design and Implementation of a Novel Hybrid Quantum-Classical Processor for Enhanced Computation Speed

Enhanced CPU Design for SDN Controller.

CAL: Core-Aware Lock for the big.LITTLE Multicore Architecture

An energy-efficient 32-bit bit-parallel superconducting SFQ specialized processor

FPGA-Based Acceleration of Polar-Format Algorithm for Video Synthetic-Aperture Radar Imaging

Lightweight ASIP Design for Lattice-Based Post-quantum Cryptography Algorithms

Design and implementation of a configurable parallel FFT processor in onboard SAR imaging system based on FPGA

THE DESIGN OF THE PIPELINED RISC-V PROCESSOR WITH THE HARDWARE COPROCESSOR OF DIGITAL SIGNAL PROCESSING

Technological Prerequisites and Consequences of Ubiquitous Computing and Networking in Resurrecting Extinct Computers

Design and Implementation of a Real-Time Imaging Processor for Spaceborne Synthetic Aperture Radar With Configurability

Vision Transformer-based overlay processor for Edge Computing

Basic Computer Architecture and Quantitative Techniques in Computer Design Instruction Pipeline ,Arithmetic Pipeline, Hazards ,Exception and Interrupts

Multi-Voltage Design of RISC Processor for Low Power Application: A Survey

Design and Evaluation of Open-Source Soft-Core Processors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Processor Design Research Articles

Related Topics

Articles published on Processor Design

Thermal design of a frequency hopping processor

Design of Low Power Control Unit for RISC-V Processor Core

Design of Variation Tolerant Near Threshold Processor Using Artificial Ecosystem Optimizer with Hybrid Deep Learning

Structural design and thermodynamic analysis of high heat dissipation on board computers based on VPX architecture

Integrating error correction and detection techniques in RISC-V processor microarchitecture for enhanced reliability

Implementation of Application Specific Instruction set Processor for Approximate Computing

Design and Implementation of a Novel Hybrid Quantum-Classical Processor for Enhanced Computation Speed

Enhanced CPU Design for SDN Controller.

CAL: Core-Aware Lock for the big.LITTLE Multicore Architecture

An energy-efficient 32-bit bit-parallel superconducting SFQ specialized processor

FPGA-Based Acceleration of Polar-Format Algorithm for Video Synthetic-Aperture Radar Imaging

Lightweight ASIP Design for Lattice-Based Post-quantum Cryptography Algorithms

Design and implementation of a configurable parallel FFT processor in onboard SAR imaging system based on FPGA

THE DESIGN OF THE PIPELINED RISC-V PROCESSOR WITH THE HARDWARE COPROCESSOR OF DIGITAL SIGNAL PROCESSING

Technological Prerequisites and Consequences of Ubiquitous Computing and Networking in Resurrecting Extinct Computers

Design and Implementation of a Real-Time Imaging Processor for Spaceborne Synthetic Aperture Radar With Configurability

Vision Transformer-based overlay processor for Edge Computing

Basic Computer Architecture and Quantitative Techniques in Computer Design Instruction Pipeline ,Arithmetic Pipeline, Hazards ,Exception and Interrupts

Multi-Voltage Design of RISC Processor for Low Power Application: A Survey

Design and Evaluation of Open-Source Soft-Core Processors