Network-on-Chip Design Research Articles

Innovative processor architecture designs are shifting towards Many-Core Architectures (MCAs) to meet the future demands of high-performance computing as the limits of Moore’s Law have almost been reached. Many-core processors utilize shared memory hierarchies to achieve high-speed memory systems, improving memory access efficiency. However, as the number of cores multiplies, the scalability of this system is significantly constrained by the increased proportion of long-distance and Non-Uniform Memory Access (NUMA). Improving the scalability of MCAs is crucial for achieving large/super-scale general-purpose many-core processors. This work proposes a high scalability memory Network-on-Chip (NoC) for Triplet-Based Many-Core Architecture (TriBA), named TriBA-mNoC. TriBA-mNoC maintains a consistent core-to-core spacing as the network scale increases, effectively preventing increased long-distance memory access latency. Moreover, it leverages an inherent advantage of shared-inside hierarchical-groupings, alleviating common NUMA issues in the NoC design. Evaluations of static network characteristics show that TriBA-mNoC outperforms most classical NoCs in network diameter, average distance, and cost. TriBA-mNoC can be integrated with TriBA in the same silicon die with a tile-like floorplan, forming a novel NoC called TriBA-NoC, which can combine the strengths of both networks to maximize the architecture performance. We evaluated the memory access performance and scalability of TriBA-NoC using the mathematical evaluation models and actual simulations with real traffic (PARSEC 3.0 and SPLASH-2) at different network scales. The mathematical evaluation results indicate that TriBA-NoC achieves an aggregate speedup of approximately 3x compared with 2D-Mesh for a similar number of cores. Furthermore, TriBA-NoC’s single-core speedup efficiency remains stable as the number of cores increases under the same cache hit ratio, while 2D-Mesh experiences a rapid decline, highlighting TriBA-NoC’s exceptional scalability. Finally, the actual traffic simulation results show that TriBA-NoC achieves an average memory access latency and time reduction of 25.90% − 40.50% and 5.61% − 31.69% respectively, compared with 2D-Mesh.

Read full abstract

Neuromorphic systems are typically designed as a tile-based architecture where inter-tile data communication is facilitated using a shared global interconnect. Congestion on this interconnect can increase both interconnect energy, which increases the total energy consumption of the hardware and latency, which impacts the performance e.g., accuracy of the application that is being executed on the hardware. Mesh-based Network-on-Chip (NoC) that is used in most hardware prototypes is not the optimal interconnect solution for neuromorphic systems. This is because of the following two reasons. First, power consumption and average latency of a NoC increases exponentially with the number of tiles in the hardware. Second, a NoC cannot exploit an application’s data communication pattern efficiently. Once designed for a target hardware, the bandwidth on each NoC link stays the same, independent of the volume of data traffic between different tile pairs of the NoC. In other words, a NoC cannot be customized at a finer granularity based on an individual application running on the hardware. We show that these NoC limitations prevent opportunities to further improve energy and latency of a neuromorphic hardware. To address these limitations, we propose Dynamic Segmented Bus (SB) interconnect for neuromorphic systems. Here, a bus lane is partitioned into segments with each segment connecting a few tiles. Connection of tiles to segments and those between segments are bridged using our novel three-way segmentation switches that are programmed using the software before admitting an application to the hardware. We partition an application by analyzing its workload and place partitions intelligently onto segments. This exploits application characteristics to use the segments without any routing collisions while exploiting the latency and energy savings in the design-time mapping phase. At a high-level, our mapping algorithm places tiles that communicate the most on shorter segments utilizing fewer number of switches, thereby reducing network congestion. It can adjust the bandwidth by controlling the number of segments connected to a destination tile. At run time, our controller dynamically executes the predefined routing paths without requiring any additionally routing decisions, unlike a NoC. This allows us to improve both energy and latency. Using parallel segmented busses, our proposed interconnect architecture can support a large number of tiles without significantly increasing the design cost, energy, and latency. Simulation results show that compared to the most widely-used mesh-based NoC design, our interconnect architecture, which we call NeuSB, reduces the switch area by 20x, average interconnect energy by 6.2x, and latency by 23%.

Read full abstract

Network-on-Chip Design Research Articles

Related Topics

Articles published on Network-on-Chip Design

A High Scalability Memory NoC with Shared-Inside Hierarchical-Groupings for Triplet-Based Many-Core Architecture

Network on Chip and Its Low Power Techniques

Design of fault tolerant algorithm for network on chip router using field programmable gate array

Optimization of layout for embedding half hypercube into conventional tree architectures

Design analysis of moth-flame optimized fault tolerant technique for minimally buffered network-on-chip router

DEMAP: differential evolution mapping for network on chip optimization

Secure Routing Framework for Mitigating Time-Delay Trojan Attack in System-on-Chip

Machine Learning Enabled Solutions for Design and Optimization Challenges in Networks-on-Chip based Multi/Many-Core Architectures

Spontaneous emission noise resilience of coupled nanolasers

NeuSB: A Scalable Interconnect Architecture for Spiking Neuromorphic Hardware

A Machine Learning Mapping Algorithm for NoC Optimization

A Reliability System Evaluation Model of NoC Communication with Crosstalk Analysis from Backend to Frontend

Artificial synapse topologies using arbitrary-order memristors

Enabling circuit-switching in modern on-chip networks

Anticipative QoS Control: A Self-Reconfigurable On-Chip Communication.

Statistical traffic pattern for mixed torus topology and pathfinder based traffic and thermal aware routing protocol on NoC

Software/Hardware Co-design of 3D NoC-based GPU Architectures for Accelerated Graph Computations

Flexible and Efficient QoS Provisioning in AXI4-Based Network-on-Chip Architecture

Pre-Silicon NBTI Delay-Aware Modeling of Network-on-Chip Router Microarchitecture

Implementation of Dynamic and Efficient Virtual Channel Router for Network on Chip with Virtual Channel Arbitration Reduction and Parallel Switch Allocation Unit

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Network-on-Chip Design Research Articles

Related Topics

Articles published on Network-on-Chip Design

A High Scalability Memory NoC with Shared-Inside Hierarchical-Groupings for Triplet-Based Many-Core Architecture

Network on Chip and Its Low Power Techniques

Design of fault tolerant algorithm for network on chip router using field programmable gate array

Optimization of layout for embedding half hypercube into conventional tree architectures

Design analysis of moth-flame optimized fault tolerant technique for minimally buffered network-on-chip router

DEMAP: differential evolution mapping for network on chip optimization

Secure Routing Framework for Mitigating Time-Delay Trojan Attack in System-on-Chip

Machine Learning Enabled Solutions for Design and Optimization Challenges in Networks-on-Chip based Multi/Many-Core Architectures

Spontaneous emission noise resilience of coupled nanolasers

NeuSB: A Scalable Interconnect Architecture for Spiking Neuromorphic Hardware

A Machine Learning Mapping Algorithm for NoC Optimization

A Reliability System Evaluation Model of NoC Communication with Crosstalk Analysis from Backend to Frontend

Artificial synapse topologies using arbitrary-order memristors

Enabling circuit-switching in modern on-chip networks

Anticipative QoS Control: A Self-Reconfigurable On-Chip Communication.

Statistical traffic pattern for mixed torus topology and pathfinder based traffic and thermal aware routing protocol on NoC

Software/Hardware Co-design of 3D NoC-based GPU Architectures for Accelerated Graph Computations

Flexible and Efficient QoS Provisioning in AXI4-Based Network-on-Chip Architecture

Pre-Silicon NBTI Delay-Aware Modeling of Network-on-Chip Router Microarchitecture

Implementation of Dynamic and Efficient Virtual Channel Router for Network on Chip with Virtual Channel Arbitration Reduction and Parallel Switch Allocation Unit