Distributed-memory Parallel Applications Research Articles

Minimum-cost flow problems widely exist in graph theory, computer science, information science, and transportation science. The network simplex algorithm is a fast and frequently used method for solving minimum-cost flow problems. However, the conventional sequential algorithms cannot satisfy the requirement of high-computational efficiency for large-scale networks. Parallel computing has resulted in numerous significant advances in science and technology over the past decades and is potential to develop an effective means to solve the computational bottleneck problem of large-scale networks. This paper first analyzes the parallelizability of network simplex algorithm and then presents a multi-granularity parallel network simplex algorithm (MPNSA) with fine- and coarse-granularity parallel strategies, which are suitable for shared- and distributed-memory parallel applications, respectively. MPNSA is achieved by message-passing interface, open multiprocessing, and compute unified device architecture, so that it can be compatible with different high-performance computing platforms. Experimental results demonstrated that MPNSA has very great accelerating effects and the maximum speedup reaches 18.7.

A hypergraph model for mapping applications with an all-neighbor communication pattern to distributed-memory computers is proposed, which originated in finite element triangulations. Rather than approximating the communication volume for linear algebra operations, this new model represents the communication volume exactly. To this end, a hypergraph partitioning problem is formulated where the objective function involves a new metric. This metric, the @l(@l-1)-metric, accurately models the communication volume for an all-neighbor communication pattern occurring in a concrete finite element application. It is a member of a more general class of metrics, which also contains more widely used metrics, such as the cut-net and the (@l-1)-metric. In addition, we develop a heuristic to minimize the communication volume in the new @l(@l-1)-metric. For the solution of several real-world finite element problems, experimental results based on this new heuristic demonstrate a small reduction in communication volume compared to a standard graph partitioner and do not show significant reductions in communication volume compared to a hypergraph partitioner using the common (@l-1)-metric. However, for this set of problems, the new approach does reduce actual communication times. As a by-product, we observe that it also tends to reduce the number of messages. Furthermore, the new approach dramatically reduces the communication volume for a set of sparse matrix problems that are more irregularly-structured than finite element problems.

Distributed-memory Parallel Applications Research Articles

Related Topics

Articles published on Distributed-memory Parallel Applications

Multi-granularity hybrid parallel network simplex algorithm for minimum-cost flow problems

Energy, Memory, and Runtime Tradeoffs for Implementing Collective Communication Operations

A new metric enabling an exact hypergraph model for the communication volume in distributed-memory parallel applications

Design and performance of a scheduling framework for resizable parallel applications

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Distributed-memory Parallel Applications Research Articles

Related Topics

Articles published on Distributed-memory Parallel Applications

Multi-granularity hybrid parallel network simplex algorithm for minimum-cost flow problems

Energy, Memory, and Runtime Tradeoffs for Implementing Collective Communication Operations

A new metric enabling an exact hypergraph model for the communication volume in distributed-memory parallel applications

Design and performance of a scheduling framework for resizable parallel applications