Switch Traversal Research Articles

Large systems-on-chip (SoCs) and chip multiprocessors (CMPs), incorporating tens to hundreds of cores, create a significant integration challenge. Interconnecting a huge amount of architectural modules in an efficient manner, calls for scalable solutions that would offer both high throughput and low-latency communication. The switches are the basic building blocks of such interconnection networks and their design critically affects the performance of the whole system. So far, innovation in switch design relied mostly to architecture-level solutions that took for granted the characteristics of the main building blocks of the switch, such as the buffers, the routing logic, the arbiters, the crossbar's multiplexers, and without any further modifications, tried to reorganize them in a more efficient way. Although such pure high-level design has produced highly efficient switches, the question of how much better the switch would be if better building blocks were available remains to be investigated. In this paper, we try to partially answer this question by explicitly targeting the design from scratch of new soft macros that can handle concurrently arbitration and multiplexing and can be parameterized with the number of inputs, the data width, and the priority selection policy. With the proposed macros, switch allocation, which employs either standard round robin or more sophisticated arbitration policies with significant network-throughput benefits, and switch traversal, can be performed simultaneously in the same cycle, while still offering energy-delay efficient implementations.

Read full abstract

Network-on-Chip (NoC) is becoming a popular solution for communication on System-on-Chips. A router is a major component of NoC which is responsible for handling the communication. Its architecture significantly impacts on the performance of NoC. In this paper, we propose a low latency router architecture based on virtual output queuing (VOQ). The number of pipeline stages of a packet transfer can be reduced to one stage, by using VOQ buffers and speculatively performing switch allocation and switch traversal in parallel. This paper also proposes a multiple VOQ architecture for which each input port maintains multiple queues for each output channel to improve the throughput of the router. We have implemented the proposed router on FPGA and evaluated in terms of communication latency, throughput and hardware amount. The experimental results show that in a 4 × 4 two-dimensional mesh network, the proposed multiple VOQ router reduces the communication latency by 25% and cost of area by 15.6% as compared to the look-ahead speculative virtual channel router.

Read full abstract

Switch Traversal Research Articles

Related Topics

Articles published on Switch Traversal

Merged Switch Allocation and Traversal in Network-on-Chip Switches

Optical high radix switch design

The Implementation of a Low Cost Single-cycle On-chip Router Based on Multiple Virtual Output Queuing

A High-throughput Router Architecture with On-the-fly Virtual Channel Allocation for On-chip Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Switch Traversal Research Articles

Related Topics

Articles published on Switch Traversal

Merged Switch Allocation and Traversal in Network-on-Chip Switches

Optical high radix switch design

The Implementation of a Low Cost Single-cycle On-chip Router Based on Multiple Virtual Output Queuing

A High-throughput Router Architecture with On-the-fly Virtual Channel Allocation for On-chip Networks