Design Of Parallel Algorithms Research Articles

Today, large scale parallel systems are available at low cost, Many powerful such systems have been installed all over the world and the number of users is always increasing. The difficulty of using them efficiently is growing with the complexity of the interactions between more and more architectural constraints and the diversity of the applications. The design of efficient parallel algorithms has to be reconsidered under the influence of new parameters of such platforms (namely, cluster, grid and global computing) which are characterized by a larger number of heterogeneous processors, often organized in several hierarchical sub-systems. At each step of the evolution of the parallel processing field, researchers designed adequate computational models whose objective was to abstract the real world in order to be able to analyze the behavior of algorithms. In this paper, we will investigate two complementary computational models that have been proposed recently: Parallel Task (PT) and Divisible Load (DL). The Parallel Task (i.e. tasks that require more than one processor for their execution) model is a promising alternative for scheduling parallel applications, especially in the case of slow communication media. The basic idea is to consider the application at a coarse level of granularity. Another way of looking at the problem (which is somehow a dual view) is the Divisible Load model where an application is considered as a collection of a large number of elementary – sequential – computing units that will be distributed among the available resources. Unlike the PT model, the DL model corresponds to a fine level of granularity. We will focus on the PT model, and discuss how to mix it with simple Divisible Load scheduling. As the main difficulty for distributing the load among the processors (usually known as the scheduling problem) in actual systems comes from handling efficiently the communications, these two models of the problem allow us to consider them implicitly or to mask them, thus leading to more tractable problems. We will show that in spite of the enormous complexity of the general scheduling problem on new platforms, it is still useful to study theoretical models. We will focus on the links between models and actual implementations on a regional grid with more than 500 processors.

The development of intelligent transportation systems (ITS) and the resulting need for the solution of a variety of dynamic traffic network models and management problems require faster‐than‐real‐time computation of shortest path problems in dynamic networks. Recently, a sequential algorithm was developed to compute shortest paths in discrete time dynamic networks from all nodes and all departure times to one destination node. The algorithm is known as algorithm DOT and has an optimal worst‐case running‐time complexity. This implies that no algorithm with a better worst‐case computational complexity can be discovered. Consequently, in order to derive algorithms to solve all‐to‐one shortest path problems in dynamic networks, one would need to explore avenues other than the design of sequential solution algorithms only. The use of commercially‐available high‐performance computing platforms to develop parallel implementations of sequential algorithms is an example of such avenue. This paper reports on the design, implementation, and computational testing of parallel dynamic shortest path algorithms. We develop two shared‐memory and two message‐passing dynamic shortest path algorithm implementations, which are derived from algorithm DOT using the following parallelization strategies: decomposition by destination and decomposition by transportation network topology. The algorithms are coded using two types of parallel computing environments: a message‐passing environment based on the parallel virtual machine (PVM) library and a multi‐threading environment based on the SUN Microsystems Multi‐Threads (MT) library. We also develop a time‐based parallel version of algorithm DOT for the case of minimum time paths in FIFO networks, and a theoretical parallelization of algorithm DOT on an ‘ideal’ theoretical parallel machine. Performances of the implementations are analyzed and evaluated using large transportation networks, and two types of parallel computing platforms: a distributed network of Unix workstations and a SUN shared‐memory machine containing eight processors. Satisfactory speed‐ups in the running time of sequential algorithms are achieved, in particular for shared‐memory machines. Numerical results indicate that shared‐memory computers constitute the most appropriate type of parallel computing platforms for the computation of dynamic shortest paths for real‐time ITS applications.

Design Of Parallel Algorithms Research Articles

Related Topics

Articles published on Design Of Parallel Algorithms

Improved QMRCGSTAB method in distributed parallel environments

Heuristics for work distribution of a homogeneous parallel dynamic programming scheme on heterogeneous systems

A parallel version of QMRCGSTAB method for large linear systems in distributed parallel environments

PLX: An Instruction Set Architecture and Testbed for Multimedia Information Processing

SCHEDULING ON LARGE SCALE DISTRIBUTED PLATFORMS: FROM MODELS TO IMPLEMENTATIONS

Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods

New Consideration on the Evaluation Model of Cluster Area Network

Parallel microgenetic algorithm design for photonic crystal and waveguide structures.

Emulations between QSM, BSP and LogP: a framework for general-purpose parallel algorithm design

BSPGRID: VARIABLE RESOURCES PARALLEL COMPUTATION AND MULTIPROGRAMMED PARALLELISM

Parallel Complexity of Matrix Multiplication

Parallel Algorithms for Dynamic Shortest Path Problems

A parallel algorithm for the eight-puzzle problem using analogical reasoning

Architecture independent parallel algorithm design: theory vs practice

Scalable Atomistic Simulation Algorithms for Materials Research

Quantitative performance analysis of the improved quasi-minimal residual method on massively distributed memory computers

Linear-scaling density-functional-theory calculations of electronic structure based on real-space grids: design, analysis, and scalability test of parallel algorithms

Identity-plus-row matrix decomposition and its application in design of parallel projection algorithms

Molecular models in computer simulation of liquid crystals

Merging on the BSP model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Design Of Parallel Algorithms Research Articles

Related Topics

Articles published on Design Of Parallel Algorithms

Improved QMRCGSTAB method in distributed parallel environments

Heuristics for work distribution of a homogeneous parallel dynamic programming scheme on heterogeneous systems

A parallel version of QMRCGSTAB method for large linear systems in distributed parallel environments

PLX: An Instruction Set Architecture and Testbed for Multimedia Information Processing

SCHEDULING ON LARGE SCALE DISTRIBUTED PLATFORMS: FROM MODELS TO IMPLEMENTATIONS

Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods

New Consideration on the Evaluation Model of Cluster Area Network

Parallel microgenetic algorithm design for photonic crystal and waveguide structures.

Emulations between QSM, BSP and LogP: a framework for general-purpose parallel algorithm design

BSPGRID: VARIABLE RESOURCES PARALLEL COMPUTATION AND MULTIPROGRAMMED PARALLELISM

Parallel Complexity of Matrix Multiplication

Parallel Algorithms for Dynamic Shortest Path Problems

A parallel algorithm for the eight-puzzle problem using analogical reasoning

Architecture independent parallel algorithm design: theory vs practice

Scalable Atomistic Simulation Algorithms for Materials Research

Quantitative performance analysis of the improved quasi-minimal residual method on massively distributed memory computers

Linear-scaling density-functional-theory calculations of electronic structure based on real-space grids: design, analysis, and scalability test of parallel algorithms

Identity-plus-row matrix decomposition and its application in design of parallel projection algorithms

Molecular models in computer simulation of liquid crystals

Merging on the BSP model