Network-on-Chip Design Research Articles

Recent advances in GPU-based manycore accelerators provide the opportunity to efficiently process large-scale graphs on chip. However, real world graphs have a diverse range of topology and connectivity patterns (e.g., degree distributions) that make the design of input-agnostic hardware architectures a challenge. Network-on-Chip (NoC)- based architectures provide a way to overcome this challenge as the architectural topology can be used to approximately model the expected traffic patterns that emerge from graph application workloads. In this paper, we first study the mix of long- and short-range traffic patterns generated on-chip using graph workloads, and subsequently use the findings to adapt the design of an optimal NoC-based architecture. In particular, by leveraging emerging three-dimensional (3D) integration technology, we propose design of a small-world NoC (SWNoC)- enabled manycore GPU architecture, where the placement of the links connecting the streaming multiprocessors (SM) and the memory controllers (MC) follow a power-law distribution. The proposed 3D manycore GPU architecture outperforms the traditional planar (2D) counterparts in both performance and energy consumption. Moreover, by adopting a joint performance-thermal optimization strategy, we address the thermal concerns in a 3D design without noticeably compromising the achievable performance. The 3D integration technology is also leveraged to incorporate Near Data Processing (NDP) to complement the performance benefits introduced by the SWNoC architecture. As graph applications are inherently memory intensive, off-chip data movement gives rise to latency and energy overheads in the presence of external DRAM. In conventional GPU architectures, as the main memory layer is not integrated with the logic, off-chip data movement negatively impacts overall performance and energy consumption. We demonstrate that NDP significantly reduces the overheads associated with such frequent and irregular memory accesses in graph-based applications. The proposed SWNoC-enabled NDP framework that integrates 3D memory (like Micron's HMC) with a massive number of GPU cores achieves 29.5% performance improvement and 30.03% less energy consumption on average compared to a conventional planar Mesh-based design with external DRAM.

Read full abstract

In the nano-scale era, Network-on-Chip (NoC) interconnection paradigm has gained importance to abide by the communication challenges in Chip Multi-Processors (CMPs). With increased integration density on CMPs, NoC components namely cores, routers, and links are susceptible to failures. Therefore, to improve system reliability, there is a need for efficient fault-tolerant techniques that mitigate permanent faults in NoC based CMPs. There exists several fault-tolerant techniques that address the permanent faults in application cores while placing the spare cores onto NoC topologies. However, these techniques are limited to Mesh topology based NoCs. There are few approaches that have realized the fault-tolerant solutions on an FPGA, but the study on architectural aspects of NoC is limited. This paper presents the flexible placement of spare core onto Torus topology-based NoC design by considering core faults and validating it on an FPGA. In the first phase, a mathematical formulation based on Integer Linear Programming (ILP) and meta-heuristic based Particle Swarm Optimization (PSO) have been proposed for the placement of spare core. In the second phase, we have implemented NoC router addressing scheme, routing algorithm, run-time fault injection model, and fault-tolerant placement of spare core onto Torus topology using an FPGA. Experiments have been done by taking different multimedia and synthetic application benchmarks. This has been done in both static and dynamic simulation environments followed by hardware implementation. In the static simulation environment, the experimentations are carried out by scaling the network size and router faults in the network. The results obtained from our approach outperform the methods such as Fault-tolerant Spare Core Mapping (FSCM), Simulated Annealing (SA), and Genetic Algorithm (GA) proposed in the literature. For the experiments carried out by scaling the network size, our proposed methodology shows an average improvement of 18.83%, 4.55%, 12.12% in communication cost over the approaches FSCM, SA, and GA, respectively. For the experiments carried out by scaling the router faults in the network, our approach shows an improvement of 34.27%, 26.26%, and 30.41% over the approaches FSCM, SA, and GA, respectively. For the dynamic simulations, our approach shows an average improvement of 5.67%, 0.44%, and 3.69%, over the approaches FSCM, SA, and GA, respectively. In the hardware implementation, our approach shows an average improvement of 5.38%, 7.45%, 27.10% in terms of application runtime over the approaches SA, GA, and FSCM, respectively. This shows the superiority of the proposed approach over the approaches presented in the literature.

Read full abstract

Network-on-Chip Design Research Articles

Related Topics

Articles published on Network-on-Chip Design

Smart Communication Using 2D and 3D Mesh Network-on-Chip

Design and Simulation of Ring Network-on-Chip for Different Configured Nodes

High-Performance and Energy-Efficient 3D Manycore GPU Architecture for Accelerating Graph Analytics

Energy Efficient NoC design through Supervised Machine Learning

A Multi-Phase Based Multi-Application Mapping Approach for Many-Core Networks-on-Chip.

Energy-efficient task-resource co-allocation and heterogeneous multi-core NoC design in dark silicon era

Fault-Tolerant Application Mapping on Mesh-of-Tree based Network-on-Chip

Application Mapping Using Cuckoo Search Optimization With Lévy Flight for NoC-Based System

Flexible Spare Core Placement in Torus Topology Based NoCs and Its Validation on an FPGA

Machine Learning Approaches for Efficient Design Space Exploration of Application-Specific NoCs

Task mapping and flow priority assignment of real-time industrial applications for network-on-chip based design

Enforcing Predictability of Many-Cores With DCFNoC

Performance Evaluation of Application Mapping Approaches for Network-on-Chip Designs

Design of an Efficient Fault and Congestion Free NoC Design using Adaptive Routing on FPGA

Impact of Electrostatic Coupling on Monolithic 3D-enabled Network on Chip

SSS

Butterfly-Fat-Tree topology based fault-tolerant Network-on-Chip design using particle swarm optimisation

Parallel overloaded CDMA crossbar for network on chip

MMNoC: Embedding Memory Management Units into Network-on-Chip for Lightweight Embedded Systems

Energy efficient heuristic application mapping for 2-D mesh-based network-on-chip

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Network-on-Chip Design Research Articles

Related Topics

Articles published on Network-on-Chip Design

Smart Communication Using 2D and 3D Mesh Network-on-Chip

Design and Simulation of Ring Network-on-Chip for Different Configured Nodes

High-Performance and Energy-Efficient 3D Manycore GPU Architecture for Accelerating Graph Analytics

Energy Efficient NoC design through Supervised Machine Learning

A Multi-Phase Based Multi-Application Mapping Approach for Many-Core Networks-on-Chip.

Energy-efficient task-resource co-allocation and heterogeneous multi-core NoC design in dark silicon era

Fault-Tolerant Application Mapping on Mesh-of-Tree based Network-on-Chip

Application Mapping Using Cuckoo Search Optimization With Lévy Flight for NoC-Based System

Flexible Spare Core Placement in Torus Topology Based NoCs and Its Validation on an FPGA

Machine Learning Approaches for Efficient Design Space Exploration of Application-Specific NoCs

Task mapping and flow priority assignment of real-time industrial applications for network-on-chip based design

Enforcing Predictability of Many-Cores With DCFNoC

Performance Evaluation of Application Mapping Approaches for Network-on-Chip Designs

Design of an Efficient Fault and Congestion Free NoC Design using Adaptive Routing on FPGA

Impact of Electrostatic Coupling on Monolithic 3D-enabled Network on Chip

SSS

Butterfly-Fat-Tree topology based fault-tolerant Network-on-Chip design using particle swarm optimisation

Parallel overloaded CDMA crossbar for network on chip

MMNoC: Embedding Memory Management Units into Network-on-Chip for Lightweight Embedded Systems

Energy efficient heuristic application mapping for 2-D mesh-based network-on-chip