Abstract

The tree-hypercube (TH) interconnection network is relatively a new interconnection network, which is constructed from tree and hypercube topologies. TH is developed to support parallel algorithms for solving computation and communication intensive problems. In this paper, we propose a new parallel multiplication algorithm on TH network to present broadcast communication operation for TH using store-and-forward technique, namely, one-to-all broadcast operation which allows a message to be transmitted through the shortest path from the source node to all other nodes. The proposed algorithm is implemented and evaluated in terms of running time, efficiency and speedup with different data size using IMAN1. The experimental results show that the runtime, efficiency and the speedup of the proposed algorithm decrease as a number of processors increases for all cases of matrices size of 1000?1000, 2000?2000, and 4000?4000.

Highlights

  • Parallel matrix multiplication is considered as a backbone for several scientific applications

  • Many proposals for matrix multiplication algorithms were done on different networks type whether it was homogeneous or heterogeneous in order to reduce the time drastically to improve the system performance

  • In matrix multiplication which is applied in tree hypercube, the coordinator creates the partitions depending on the number of processors and the matrix size, once the data is received by any processor it takes the part that it is responsible for and resends it again to the nodes that has a direct link with it

Read more

Summary

INTRODUCTION

Parallel matrix multiplication is considered as a backbone for several scientific applications. In this paper we apply matrix multiplication on tree-hypercube which was used before in adaptive fault tolerate in routing algorithm [8], [11], [12], [17]. This paper aimed to design and analyze efficient matrix multiplication algorithm on tree-hypercube network. We propose a new Parallel Matrix Multiplication Algorithm on Tree-Hypercube Network Using IMAN1 Supercomputer. CN stops the process of distribution and announces the beginning of the Stage (b)

PROPOSED ALGORITHM
Partition Procedure Analysis
EVOLUTION RESULT
Run Time Evaluation
Speed up Evaluation
Parallel Efficiency Evaluation
Findings
CONCLUSIONS AND FUTURE WORKS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call