Cluster Of Workstations Research Articles

Clusters of workstations are a popular platform for high-performance computing. For many parallel applications, efficient use of a fast interconnection network is essential for good performance. Several modern System Area Networks include programmable network interfaces that can be tailored to perform protocol tasks that otherwise would need to be done by the host processors. Finding the right trade-off between protocol processing at the host and the network interface is difficult in general. In this work, we systematically evaluate the performance of different implementations of a single, user-level communication interface. The implementations make different architectural assumptions about the reliability of the network and the capabilities of the network interface. The implementations differ accordingly in their division of protocol tasks between host software, network-interface firmware, and network hardware. Also, we investigate the effects of alternative data-transfer methods and multicast implementations, and we evaluate the influence of packet size. Using microbenchmarks, parallel-programming systems, and parallel applications, we assess the performance of the different implementations at multiple levels. We use two hardware platforms with different performance characteristics to validate our conclusions. We show how moving protocol tasks to a relatively slow network interface can yield both performance advantages and disadvantages, depending on specific characteristics of the application and the underlying parallel-programming system.

Read full abstract

AbstractIn treatment planning in a heavy particle cancer radiation facility which is utilized by multiple remote medical institutions, it is necessary to calculate accurately the three‐dimensional dose distribution within the body of the patient in order to achieve efficient treatment. However, this presents a problem, since a long processing time is needed, which degrades the efficiency of treatment planning. In order to handle this problem, the authors used a cluster system in which Alpha 21164A CPUs were connected over a 100 Mbit/s Ethernet to speed up the dose distribution calculation (Dose) and digital reconstructed X‐ray image regeneration (DRR) by parallel processing. Parallel calculation is performed by segmenting the calculation area. The communication time among processors is reduced by data compression, and the load is made uniform by assigning more than one area to a processor. As a result of evaluation experiments, calculations performed iteratively by varying parameters at the actual site of medical care were accelerated by using 10 processors, by a factor of 7 for Dose, and 9 for DRR. In other words, the speed was improved in proportion to the number of processors. This implies that processing which has required several tens of seconds can now be handled in 3 to 6 seconds, which is short enough for the user to wait before the terminal. The objective is to make the system usable by multiple remote medical institutions by refining system functions such as fault recovery in order to ensure its reliability as a calculation server. © 2004 Wiley Periodicals, Inc. Syst Comp Jpn, 35(8): 96–106, 2004; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.10277

Read full abstract

Cluster Of Workstations Research Articles

Related Topics

Articles published on Cluster Of Workstations

Adaptive data parallel computing on workstation clusters

A networked client–server environment with CORBA interface for parallel FE analysis

Parallel Line Search in Method of Feasible Directions

Cluster communication protocols for parallel-programming systems

A hybrid self‐organizing maps and particle swarm optimization approach

Local DNA sequence alignment in a cluster of workstations: Algorithms and tools

A Study on Global and Local Optimization Techniques for TCAD Analysis Tasks

Optimizing the execution of multiple data analysis queries on parallel and distributed environments

Speed‐up of radiation treatment planning using a workstation cluster

Using Distributed Computers to Deterministically Approximate Higher Dimensional Convection Diffusion Equations

Modeling Clustered Task Graphs for Scheduling Large Parallel Programs in Distributed Systems

Inexact block Newton methods for solving nonlinear equations

Numerical Performance of the Distributed Vector Finite-Element Time-Domain Algorithm

Modeling the dewatering and depressurization of the Lihir open-pit gold mine, Papua New Guinea

The design of a distributed MATLAB-based environment for computing pseudospectra

Efficient resource management applied to master–worker applications

The design and implementation of a modular and extensible Java Virtual Machine

Evaluating linear recursive filters on clusters of workstations

Quick Matrix Multiplication on Clusters of Workstations

Optimization of Flapping Airfoils for Maximum Thrust and Propulsive Efficiency

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cluster Of Workstations Research Articles

Related Topics

Articles published on Cluster Of Workstations

Adaptive data parallel computing on workstation clusters

A networked client–server environment with CORBA interface for parallel FE analysis

Parallel Line Search in Method of Feasible Directions

Cluster communication protocols for parallel-programming systems

A hybrid self‐organizing maps and particle swarm optimization approach

Local DNA sequence alignment in a cluster of workstations: Algorithms and tools

A Study on Global and Local Optimization Techniques for TCAD Analysis Tasks

Optimizing the execution of multiple data analysis queries on parallel and distributed environments

Speed‐up of radiation treatment planning using a workstation cluster

Using Distributed Computers to Deterministically Approximate Higher Dimensional Convection Diffusion Equations

Modeling Clustered Task Graphs for Scheduling Large Parallel Programs in Distributed Systems

Inexact block Newton methods for solving nonlinear equations

Numerical Performance of the Distributed Vector Finite-Element Time-Domain Algorithm

Modeling the dewatering and depressurization of the Lihir open-pit gold mine, Papua New Guinea

The design of a distributed MATLAB-based environment for computing pseudospectra

Efficient resource management applied to master–worker applications

The design and implementation of a modular and extensible Java Virtual Machine

Evaluating linear recursive filters on clusters of workstations

Quick Matrix Multiplication on Clusters of Workstations

Optimization of Flapping Airfoils for Maximum Thrust and Propulsive Efficiency